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PREFACE 


This  volume  is  part  of  a  five- volume  set  that  summarizes  the  research  of  participants  in  the  1997  AFOSR 
Summer  Research  Extension  Program  (SREP.) 

Reports  presented  in  this  volume  are  arranged  alphabetically  by  author  and  are  numbered  consecutively  - 
e.g.,  1-1, 1-2, 1-3;  2-1, 2-2, 2-3.  Reports  in  the  five-volume  set  are  organized  as  follows: 
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Deep  Quantum  Well  Channels  for  Ultra  Low  Noise  HEMTs  for  Millimeter  and 

Sub-millimeter  Wave  Applications 

A.  F.  M.  Anwar 
Associate  Professor 

Electrical  and  Systems  Engineering  Department 
The  University  of  Connecticut 
Storrs,  CT  06269-2157 

Abstract 

A  complete  model  that  includes  (a)  quantum  well  calculations  (b)  transport  and  (c) 
noise  for  deep  quantum  well  HEMT  is  presented.  Schroedinger  and  Poisson’s  equations  are  solved 
self-consistently  to  determine  the  quantum  well  parameters  of  the  deep  qxiantum  well  formed  in 
AlGaAsSb/InGaAs/AlGaAsSb.  Time  independent  Bokzman  tran^ort  equation  (BTE)  is  solved 
using  ensemble  Monte  Carlo  to  study  the  velocity-electric  field  characteristic  in  fiixGai.xAs.  It  is  seen 
that  impact  ionization  in  narrow  band  gap  channel  gives  rise  to  a  degradation  in  the  low  field  mobility 
and  peak  velocity.  The  effect  of  impact  ionization  is  modeled  by  a  voltage  dependent  current  source 
in  the  small  signal  equivalent  circuit.  The  ciurent  source  is  characterized  by  an  uiqtact  ionization 
induced  transconductance  wfiose  magnitude  can  be  extracted  from  the  experimental  Y-parameter 
data.  The  solution  of  the  time  dependent  BTE  indicates  that  inpact  ionization  process  is  frequency 
dependent  and  this  dependence  is  also  a  function  of  drain  current  and  source  drain  pacing.  hixGai. 
xASySbi.y  provides  excellent  tranport  properties  and  may  be  used  as  a  channel  material  in  HEMTs. 
The  usefulness  of  the  quaternary  channel  can  be  further  enhanced  if  a  siperlattice  channel  is  used 
instead  of  an  alloyed  channel 


1-2 


Deep  Quantum  Well  Channels  for  Ultra  Low  Noise  HEMTs  for  Millimeter  and 

Sub-miliimeter  Wave  Applications 


L  Introduction 

A  model  for  study  deep  QW  channel  HEMTs  is  presented.  The  theoretical  calculations 
include  a)  self  consistent  solution  of  Schroedinger  and  Poisson’s  equation  and  b)  transport  in  InGaAs 
ternary  channel.  Transport  in  investigated  by  solving  Bohzman  Transport  Equation  (BTE)  using 
ensemble  Monte  Carlo  simulation.  The  solution  of  Schroedinger  and  Poisson’s  along  with  the 
description  of  the  device  along  the  growth  direction  yields  charge  control  Transport  data,  such  as 
low  field  mobihty  and  saturation  velocity  given  the  device  layout  and  charge  control  provides  the  d.c. 
I-V  curves  and  the  small  signal  parameters.  The  availability  of  the  small  signal  parameters  enable  one 
to  model  noise  and  the  device  performance  at  high  frequency. 

In  Sec.2,  the  method  used  to  solve  Schroedinger  and  Poisson’s  is  discussed  along  with 
relevant  results.  In  Sec.3,  the  transport  in  the  channel  material  is  presented.  The  effect  of  impact 
ionization  is  taken  into  account  in  the  transport  study..  In  Sec. 4,  the  modeled  I-V  and  small  signal 
parameters  are  presented.  The  calculation  of  drain  current  takes  into  effect  impact  ionization  in  the 
channel.  In  Sec.5,  noise  in  the  channel  in  devices  is  addressed  theoretically.  In  Sec.6,  the  frequency 
spectrum  of  impact  ionization  is  investigated  by  solving  time  dependent  BTE.  The  use  of  InGaAsSb 
as  the  ideal  chaimel  material  is  investigated  in  Sec.  7.  A  conclusion  is  provided  in  Sec.  8. 

ELI  Quantum  well  and  material  characterization 
ELl.l  Material  Data 

The  calculation  of  conduction  band  discontinuity  AEc  in  AlGaAsSb/InAs  is  reported  by 
Anwar  and  Webster'  and  is  an  extension  of  Schuermeyer  et.  al^.  A  diagram  is  constructed  beghming 
with  known  valence  band  energy  differences  for  binary  systems  and  then  adding  the  bandgap  energies 
to  find  the  conduction  band  minimum  for  each  binary  con:q)ound.  The  energy  bandgap  s  of  the  ternary 
systems  are  computed  by  interpolation  over  alloy  conq)osition  using  bowing  parameters'.  The 
calculation  of  the  energy  bandgap  and  the  lattice  constants  of  the  quaternary  follows  the  work  of 
Moon  et.  al.^,  Ghsson  et.  al^  and  Svennson  et.  al^  The  energy  bandgap  of  the  quaternary  can  be 
written  as: 


1-3 


EG(x,y)=(l-x)Ti4(y)+xT23(y)-A(x,y)  (n.1-1) 

where  Tij(y)  =  yBj  +  (1-y)  Bi  -  y(l-y)Cij  are  the  ternary  alloy  bandgaps,  Bi's  are  the  bandgap  of  the 
binaries,  Cy  are  the  bowing  parameters  for  the  ternary  alloy,  and  A(x,y)  =  x(l-x)[(l-y)Ci2+yC43]  + 

x( l-x)y( l-y)CQ,  where  Cq  =  C^x  +  C23(l-x).  Using  the  parameters  listed  in  Ref*,  EG(x,y)  is 
calculated  for  the  F-,  X-,  and  L-valleys  and  the  lowest  result  is  chosen  as  the  bandgap.  The  lattice 
constant  of  the  quaternary  Lq  is  interpolated  using  the  following  relationsh^: 

Lq  =  L,  +  (L2-L,)x-KL4-L,)y  +  (LrL2+L3-L4)xy  (H.  1-2) 
where  Li's  are  the  lattice  constants  of  the  binaries*.  Finally,  AEc  for  the  lattice  matched  AlxGai-xAsi. 
ySby/InAs  is  determined  by  calculating  the  difference  between  the  conduction  band  energies  of  InAs 
and  AlxGai-xAsi.ySby 

II.1.2_Ad  Envelope  Function  Description  of  Deep  Quantum  Wells 

In  this  section  we  report  a  self-consistent  solution  to  model  the  QW  formed  in  the  conduction 
band  of  an  AlGaAsSb/InAs/AlGaAsSb  heterostructure®.  The  results  of  this  analysis  will  enable  us  to 
compute  the  charge  control,  current-voltage  and  noise  performance  of  this  class  of  devices. 
Furthermore,  this  analysis  will  guide  material,  device  and  fabrication  research  to  achieve  ultra  low 
noise,  very  high  frequency  HEMTs. 


II.1.3  Mathematical  Model 

In  AlGaAsSb/InAs/AlGaAsSb  systems  the  QW  is  formed  in  InAs.  The  one  electron 
Schrbdinger  equation,  under  effective  mass  approximation,  can  be  written  as® 

+(V(>t)-Ei)5i=o  (n.1-3) 


d 

^  1  d^ 

UJ 

dx 

Im* 

where  m*  is  the  electron  effective  mass,  h  is  the  reduced  Planck's  constant,  Q(x)  is  the  envelope  wave 
function,  Ei  is  the  energy  eigen  value,  V(x)  is  the  potential  energy  and  the  subscr^t  ‘T’  denotes  the  i**" 
subband.  For  simplicity  the  potential  energy  flmctions  approximated  by  three  straight  lines  with 
slopes  at,  a2  and  a3,  re^ectively,  and  is  expressed  as 
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V(x)= 


(n.1-4) 


[V„  x<0 

l^ajX+AEj  Xj.i<x<Xj  j=l,2,3 

wiiere  AEi  =  0,  AE2  =  (ai  -  a2)xi,  AE3  =  AE2  +  AEc2  +  L(a2  -  83),  AEc2  is  the  conduction  band 
discontinuity  at  the  second  heterointerface,  L  =  X2  -Xo  is  the  width  of  the  well,  xo  =  0  is  the  position 
of  the  first  heterointerface,  and  X3  is  the  distance  fi:om  the  first  heterointerface  in  which  99%  of  the 
electrons  reside.  The  solution  to  the  Schroedinger  equation,  for  different  regions,  may  be  written  as 

^j(x)=aLoe  +a2,oe  J=0 

^ j  (x)=a  ij Ai((;j  )+a  2 jBi(<;j )  j=  1,2,3 


(n.1-5) 


where  ,  AEd  is  the  conduction  band  discontinuity  at  the  first 

* 

heterointerface,  the  electron  effective  mass  in  AlGaAsSb,  Ai  and  Bi  are  the  Airy  and  the 

/  *  2^^ 

,with  yj=l  2mjaj/ft  1  and  akj’s 


AEj-E 


complementary  function,  respectively,  (ij=yj 

y  “j 

(k=l,2  &  j  =  0, 1,2,3)  are  the  arbitrary  constants.  Here  the  subscript  j  refers  to  region  j  and  the 
superscript  i,  whenever  used,  will  refer  to  the  i**"  subband.  The  eigen  values  and  eigen  functions  are 
determined  by  applying  the  two  boundary  conditions  at  any  inter&ce  (a)  continuity  of  the  wave 
function  and  (b)  continuity  of  the  first  derivative  by  taking  into  account  the  proper  effective  mass. 
Having  formulated  the  Schrodinger  equation  Poisson's  equation  is  formulated; 


2 

dx  i 


(n.1-6) 


* 

where  ngj  l+e^^*^  Ei)/kT'j  ^  number  of  electrons  per  unit  area,  C  is  the  envelope 

wave  function  in  the  hh  subband,  N,  is  the  acceptor  density  in  the  unintentionally  doped  AlGaAsSb 
* 

layer,  mj^ys^s  refers  to  the  effective  mass  in  the  channel,  T  is  the  temperature  and  Ep  is  the  Fermi  level 
at  the  interface  relative  to  the  conduction  band  in  the  channel  at  x=0.  In  these  equations  we  have 
chosen  the  potential  energy  at  the  interface  as  the  reference.  The  Fermi  level  Ep  is  e^ressed  as 
EF  =  q[(l>(0)~<l>(W)]+EFo+AEc2,  ^\llere  (|)(0)  -  <j)(W)  is  the  total  band  bending  in  the  AlGaAsSb 
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layer  <1)(0)  =  0,  W  is  the  depletion  depth, 


E 


Eg(T) 

2 


+kTln 


Na 

ni(T) 


is  the  position  of  the  Fermi 


level  with  respect  to  the  conduction  band  in  the  bulk  AlGaAsSb  and  ni(T)  is  the  intrinsic  carrier 
concentration  in  AlGaAsSb.  The  slopes  aj  of  the  straight  lines,  wldch  approximate  the  shape  of  the 
QW,  are  proportional  to  the  average  electric  field  determined  by  Poisson's  equation.  By  integrating 
Poisson’s  equation  twice  with  respect  to  x,  the  slopes  can  be  expressed  in  the  form 
aj  =  (q"/sX^  ns  +  N,  W),  j=  1,2,3  (H.  1-7) 

where 


fj=l-E— - r  dxf  ?f(x')dx',  j=l,2,3  (n.1-7.1) 

and  ns  =  ^.n5i  is  the  channel  electron  density  in  cm‘^.  By  solving  the  one  electron  Schrodinger 

equation  for  the  given  potential  we  can  obtain  the  eigen  energies  and  the  wave  fimctions  for  the 
system.  Tlie  eigen  energies  and  the  wave  fimctions  determine  the  shape  of  the  electron  distribution 
in  the  quantum  well  which  is  then  used  to  solve  Poisson’s  equation.  The  two  equations  are  solved 
self-consistently  until  we  have  accounted  for  99%  of  the  carriers  in  the  quantum  well. 


11.1.4  Results  and  Discussion 

An  Ino,53  Ga  0.47  As/Alo  6  Gao.4  AsSb  double  barrier  HEMT  with  a  QW  width  of  lOOA  is 
considered.  The  calculated  band  offset  is  0.984eV.  A  low  field  mobility  of  53,000  cmW-s  and  a 
saturation  velocity  of  3.2xl0’  cm/s  is  used  in  the  simulation’.  The  donor  doping  density  of  2x  10‘* 
cm'^  with  a  doped  epUayer  thickness  of  350A  is  used. 

In  Fig.  1,  the  Fermi  level  and  X„  are  plotted  as  a  fimction  of  the  2DEG  concentration.  The 
plots  are  obtained  by  solving  Schroedinger  and  Poisson's  equations  self-consistently.  The  fimctional 
forms  obtained  for  Ef  and  X^v.  For  d.c.,  small  signal  and  noise  analysis  Ef  and  X.v  are  e7q)ressed  in 
fimctional  forms  as  is  evident  in  Secs.  4  and  5. 

in  Transport  in  InGaAs 

In  this  section,  we  mainly  discuss  the  transport  properties  of  In^Ga^xAs.  Monte  Carlo 
technique  to  solve  BTE  is  used  very  successfully  to  simulate  electrons  transport  in  semiconductors. 
The  Monte  Carlo  simulation  includes  the  different  scattering  mechanisms  such  as  acoustic  phonon 
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scattering,  polar  optical  phonon  scattering,  deformation  potential  scattering,  unpurity  scattering, 
alloy  scattering,  piezoelectric  scattering  and  impact-ionization.  This  section  is  concluded  with  Monte 
Carlo  simulation  results. 

ni.l  Monte  Carlo  Method 

Many  parameters  of  a  physical  system  are  governed  by  probability  distributions.  Therefore,  if 
a  mathematically  random  distribution  is  used  to  model  such  distributions  we  can,  in  principle, 
generate  the  physical  values  of  these  parameters.  This  is,  broadly  speaking,  the  Monte  Carlo  method. 
In  practice,  of  course,  the  physical  distribution  may  be  quite  complex  and  difficult  to  manipulate  even 
with  a  computer.  The  manipulations  can  be  simplified  by  “mapping”  the  complex  distributions  on  to 
a  simple  pseudo-random  distribution;  the  most  convenient  pseudo-random  distribution  is  the  uniform 
distribution,  which  is  readily  available  on  most  computer  system.*'* ‘ 

In  general,  if  p((t))  and  p(r)  are  the  respective  probabUity  densities,  associated  with  (j)  in  the 
Physical  distribution  and  r  in  the  pseudo-random  distribution,  then 

♦  t 

jp(<j)')d(j)'=jp(r’)dr’  (HI.  1-1) 

0  0 

In  a  uniform  distribution  p(r)=l  so  that  (ni-1)  becomes 

r  =  J  p((l)' )d<l)  (m.1-2) 

Hence,  provided  that  this  integral  can  be  evaluated  in  simple  closed  analytical  form,  inversion 
will  yield  a  random  value  for  the  physical  variable  (j)  in  terms  of  the  uniformly  distributed  random 
number  r.  A  simple  example  of  this  technique  is  the  generation  of  the  random  flight  time  of  a  classical 
particle  in  a  gas  for  the  case  wfien  T,  the  total  scattering  rate,  is  constant.  The  probability  of  this 
particle  traveling  unimpeded  for  time  t  and  then  being  scattered  at  the  end  of  this  flight  is 

p(t)=re-"  (in.  1-3) 

From  equation  (HI.  1-2) 

r  =  |re'"dt'=  1-e'*"  (m.1-4) 

0 

so  that  the  random  flight  times  are  given  by 
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(in.  1-5) 


t  =  _lin(i-r)  =  -lln(r) 

Note  that,  since  r  is  uniformly  distribution,  In(l-r)  is  equivalent  to  ln(r). 

In  effect,  what  happens  in  the  calculation  is  that  the  cumulative  probability  P(t)  is  calculated, 
Le.  the  area  imder  the  probability  curve  up  to  a  value  t,  and  its  value  is  then  mapped  on  to  a  uniform 
random  number  distribution  from  which  t  is  directly  selected. 

If  p(t)  is  the  probability  per  unit  time  that  an  electron  has  a  flight,  of  duration  time  t, 
terminated  by  some  scattering  process  (i.e.  has  a  drift  time  t  in  momentum  space  and  its  then 
scattered)  then  the  solution  of  equation  (III.  1-2)  in  the  term  form 

r  =  jp(f)df  (HI.  1-6) 

0 

will  enable  a  random  distribution  of  such  flight  time  to  be  generated. 

Now  suppose  that  the  electron  drifts  for  a  time  before  being  scattered  and  that  this  time 
consists  of  n  tiny  increments  6ti,  8t2,  ...  ,  6tn.  The  probability  of  the  electron  being  scattered,  within 
the  time  interval  5ti,  is  X(k)  6ti,  where  X.(k)  is  the  total  scattering  rate  defined  by  following  equation: 

=  XUlA-l) 

where 

X.(k)  =  js.(k,k')dk  (III.  1-8) 

the  probability  that  there  will  not  be  any  scattering  during  5ti  is  thus  (l->.(k)  St;).  Therefore,  the 
probability  of  an  electron  drifting  ia  momentum  space,  for  a  time  t,  is 

S(t)  =  n(l-X(k)8t,)  (in.  1-9) 

i-1 

so  that 

log{S(t)}  =  2log(l  -  Mk)5t, )  (m.  1-10) 

i-1 

However,  since  ^k)  St; «  1,  equation  (HI.  1-10)  reduces  to 

iog{s(t)}  =  -j;8t,x(k)  (m.1.11) 

i-1 

which  gives,  immediately 
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(in.  1-12) 


(m.i-13) 


S(t)  =  expj- j  A.(k)dt' j 

where  X,(k)  is  a  function  of  time  through  following  equation: 

The  probability  density  p(t)  is  therefore 

p(t)=  ?L(k)exp|-jMt)dt'|  (in.1-14) 

Using  equation(III.l-14)  and  a  uniformly  distribution  random  number  distribution  gives 


r  =  1  -  exp  -jMk)df 


(m.i-15) 


In  the  program,  we  include  acoustic  phonon  scattering,  polar  optic  phonon  scattering, 
intervalley  scattering,  impurity  scattering,  alloy  scattering  and  piezoelectric  scattering. 


IIL2  MC  results  for  some  materials 

In  this  section,  the  calculated  scattering  rates  for  some  materials  are  shown  that  are  used  in 
the  MC  simulation.  In  Fig.  III.2-1,  the  scattering  rate  due  to  impurity  is  shown  as  function  of  incident 
electron  energy. 

In  Fig. III. 2-2,  we  calculate  the  interface  scattering  rate  as  a  function  of  rough  length  L  in 
InGaAs/InAlAs  system.  The  Ino53Gao47As-Ino52AIo48As  modulation-doped  (MD)  heterostructures 
is  important  for  FET  applications.  Since  in  an  MD  structure  the  electron  motion  occurs  close  to  the 
interface,  it  is  important  to  develop  a  model  for  the  structural  quality  of  an  interface  and  use  it  to 
calculate  the  effects  of  interface  roughness.  Following  Hong’s  work*^,  the  scattering  rate  due  to 
interface  roughness  is  given  as: 

where  Ji  is  the  first-order  Bessel  function,  N«x  is  the  accumulation-layer  charge  density  and  is  the 
2-D  electron  carrier  density.  In  Fig.  111-2,  scattering  rate  for  InGaAs-InAJAs  interface  is  plotted  as  a 
function  of  L.  It  is  important  to  realize  that  as  L  increases  there  is  initially  a  sharp  increase  in  Winter 
(sharp  decrease  in  mobility  ),  but  beyond  Ls50  angstrom,  Wuhct  become  independent  of  L  and 
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eventually  begins  to  decrease  (  mobility  start  to  increase).  We  can  say  that  once  the  roughness 
increases  beyond  l/Kp,  the  electrons  do  not  “feel”  the  surface  roughness  as  much. 

nL3  Velocity-field  characteristics 

In  this  section,  simulated  velocity-electric  field  curve  for  a  few  confound  semiconductors  are 
shown.  The  transport  and  other  material  parameters  used  in  the  simulation  are  shown  in  Table- 1.  The 
simulation  results  are  obtained  after  100,000  interaction.  Some  of  the  computation  are  carried  out  on 
the  Air  Force  computer  system 


Table  1.  Parameters  for  III-V  semiconductor. 


Parameter 

GaAs 

GaN 

InAs 

InP 

GaSb 

GaP 

Density 

5.37 

6.1 

5.667 

4.81 

5.6137 

5.477 

Velocity  of  Sound 

5.22 

5 

m 

4.594 

5.847 

High  freq.  dielectric  constant 

10.82 

5.35 

12.37 

9.61 

14.44 

9.55 

Static  dielectric  constant 

12.53 

9.5 

15.15 

12.56 

15.69 

12.4 

Polar  optic  phonon  freq. 

5.37 

15.1 

4.54 

8.79 

8.79 

Equivalent  intervalley  phonon  freq. 

4.54 

4.54 

3.99 

5.63 

5.63 

Non-equivalent  intervalley  phonon  freq. 

4.54 

4.54 

3.99 

5.63 

5.63 

Acoustic  deformation  potential 

7 

12 

5.8 

m 

9 

Equivalent  intervalley  deformation  potential 

I.OOE+IO 

l.OOE+10 

l.OOE+09 

1.80E+09 

l.lOE+09 

Non-equivalent  intervalley  deform,  potential 

l.OOE+10 

l.OOE+10 

l.OOE+09 

9.00E+08 

6.00E+09 

Central  valley  effective  mass 

0.067 

0.13 

0.0239 

0.079 

0.22 

0.0925 

Satellite  valley  effective  mass 

0.35 

0.7 

0.72 

0.43 

1.1 

0.42 

Valley  separation 

0.36 

1.5 

1.138 

1.8 

1.466 

1-10 


Band  gap 

1.42 

3.38 

0.354 

1.344 

0.75 

2.78 

Lattice  constant 

5.56 

4.5 

6.0583 

5.8687 

6.09593 

5.45 

IIL4  Impact  Ionization  Process 
rn.4.1  Some  Previous  Model 

The  impact-ionization  ( I.I )  process  is  very  important  to  the  study  of  modem  semiconductor 
devices,  such  as  avalanche  photo  detectors,  in  which  gain  is  provided  by  carrier  multiplication.  In  an 
I.I  process  a  charge  carrier  with  high  kinetic  energy  (  ‘liot  carrier”  )  coUides  with  a  second  charge 
carrier,  transferring  its  kinetic  energy  level.  I.I  process  may  be  classified  as  band-band  process  or 
band  trap  process  depending  on  whether  the  second  carrier  is  initially  in  the  valence  band  and  makes 
a  transition  from  the  valence  band  to  the  conduction  band,  or  whether  it  is  initially  at  localized  level  ( 
trap,  donor,  acceptor  ),  and  makes  a  transition  to  a  band  state.  In  HEMTs  with  narrow  bandgap 
channel  (like  InAs,  InGaAs),  impact  ionization  can  affect  the  device  performance. 

Wolff^^  is  perhaps  the  first  to  put  forward  a  diffusion  model  for  impact  ionization  with  an 
argument  that  electrons  gain  energy  gradually  due  to  many  collisions.  Wolff’s  model  is  likely  to 
dominate  at  very  high  electric  fields  where  the  average  electron  energy  is  comparable  to  the 
ionization  threshold  energy.  Shockley’s  model'**,  often  referred  to  as  the  lucky  electron  model, 
presumes  that  only  those  few  electrons  lucky  enough  to  avoid  collision  gain  sufficient  energy  from 
field  for  the  ionization.  These  lucky  electrons  could  be  characterized  by  their  ballistic  movement. 
This  balhstic  movement  is  ejqjected  to  be  dominant  at  low  electric  field  v^ere  the  electron  energy  is 
very  low  compared  to  the  ionization  threshold  energy. 

The  most  widely  used  model,  due  to  Baraff^^  reduces  to  Shockley’s  model  in  the  low-field 
range  while  it  converges  to  Wolff’s  model  in  high-field  limit.  Recently,  Ridley*^’*^  has  extended 
Shockley’s  lucky  electron  model  by  making  a  distinction  between  the  rates  of  momentum  and  energy 
relaxation.  This  ‘lucky  drift”  model  is  intermediate  between  Shockley’s  ballistic  state  and  WolflPs 
equihbrium  state.  All  these  model  ,  however,  assume  that  the  mean  free  paths  for  scattering  are 
constant  and  independent  of  energy.  In  general,  the  mean  free  paths  do  depend  on  energy. 
Furthermore,  these  models  do  not  give  the  exact  form  of  the  energy  distribution  fimction. 
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Perhaps  the  first  attempt  for  energy  dependent  results  is  that  by  Keldysh^*  vdio  solved  the 
Boltzmann  transport  equation  on  the  basis  of  an  assumed  form  for  the  symmetric  part  of  the  energy 
distribution  function.  Chen  and  Tang  have  provided  an  explicit  expression  for  energy-dependent 
mean  fi-ee  path  that  is  required  for  the  use  of  Keldysh’s  results.  By  using  simple  band  structure  and 
asgiimp!  that  collision  of  impact  ionization  do  not  dependent  on  the  angle  between  initial  and  final 
state,  Keldysh  give  a  simple  formula  for  the  rate  of  impact  ionization; 


1 

Xa(E) 


(m.4-1) 


where  E*  is  a  threshold  energy  and  1/-^  )  is  the  electron- optic-phonon  scattering  rate  averaged 


over  all  electron  wave  vectors  corresponding  to  the  threshold  energy  E^,.  Finally,  P  is  a  coefficient 
which  we  consider  merely  a  fitting  parameter,  and  it  dependent  with  materials.  Keldysh’s  model  is 


often  referred  to  as  “hard  threshold”  model. 

Another  important  model  is  “random-k”  approximation  by  Kane’’.  In  Kane’s  model,  the 
energy-dependent  rate  for  inelastic  scattering  of  production  of  electron-hole  pairs  is  computed  by 
first-order  perturbation  theory  using  a  screened  Coulomb  interaction  with  a  fi-equency-  and 
momentum-  dependent  dielectric  function  in  the  random-phase  approximation.  The  threshold  for 
momentum-conserving  pair  creation  is  foimd  to  very  close  to  that  determined  by  energy  conversion 
alone.  Using  this  “random-k”  method,  the  scattering  rate  for  primary  holes  is  obtained  and  found  to 
be  almost  identical  with  that  for  primary  electrons  of  comparable  energy.  The  secondary-particle 
energy  distribution  function  are  also  determined  for  primary  holes  and  electrons.  One-electron  state 
-density  structure  is  prominent  in  these  distributions. 


nL4.2  Present  Work 

Our  work  is  partly  based  on  the  models  proposed  by  Kane  and  Kunikiyo  First  of  all,  we 
just  interest  at  the  electron  behaves  near  the  thrediold  energy,  so  we  assume  that  the  initial  state  of 
secondary  electron  is  near  the  top  of  valence  band,  and  we  do  not  care  about  the  hole’s  energy.  For 
simplify  the  calculation,  we  begin  with  sinq)le  band  structure. 
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The  impact  ionization  rate  is  calculated  by  the  use  of  Fermi’s  golden  rule.  In  the  mqiact- 
ionization  process,  and  electron  in  the  conduction  band  interacts  with  an  electron  in  the  valence  band 
via  the  screen  Coulomb  potential  V(r-r ): 


V(r-r)  = 


47ie(q,to) 


(m.4-2) 


If  state  1  (  a  Bloch  state  with  band  index  yi  and  wave  vector  ki  )  and  2  are  the  states  of  the  initial 
electron  in  the  conduction  band  and  valence  band  before  the  transition,  respectively,  and  1  and  2  are 
those  of  the  final  electrons  in  the  conduction  band  after  the  transition,  the  ionization  rate 
1/ti  i.(1,2->1  ,2  )  is  expressed  by: 


1 


27t 


t,,(1,2->1,2)  h 


M  '  +  M  '  +  M  -M  ' 

■  D  as 


*6(E, +Ej-E,,-E,.)  (in-4-3) 


where  6(E)  is  the  energy  conserving  delta  fimction  and  Ei(i-l,2,r,2’)  denote  the  energy  of  each 
electron.  The  duect  matrix  element  Ma  and  the  exchange  matrix  element  Mb  are  given  by: 


M.  =( 


M,  =  vp,.(r,)viy,(rj 


(ffl.4-3.1) 


(m.4-3.2) 


where  8(q,co)  is  a  dielectric  fimction  depending  on  both  wave  vector  and  transition  energy. 
|Ma|^+lMbl^  in  equation  (in.4-3)  denotes  the  matrix  element  in  the  case  that  the  spin  of  electron  1  and 
2  is  different,  while  jMa-Mbl^  denotes  the  matrix  element  in  the  case  that  spin  of  electron  1  and  2  is 
the  same.  The  wave  fimction  of  electron  vpvjc  is  expressed  by  the  Fourier  series  over  reciprocal  lattice 
vector  G 

V^O  <5 

where  kv  is  the  wave  vector  associated  with  energy  band  v.  The  wave  function  are  obtained  from  the 
empirical  local  pseudopotential  method.  If  we  represent  l/lrrr2l  by  the  Fourier  expansion,  then 

1  1  y  (m.4-5) 

k-lil  V,^q’ 

By  substituting  (HI. 4-4)  and  (in.4-5)  into  equation  (in.4-3)  and  (III.4-3.  l),(III.4-3.2),  we  get: 
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e(q,®)  V,  G,.Gr:s:..G, 

(G,)A.  (GJ- - - IT 

k,r.V  1^  k,r:V  +G^__k^_G,| 

*S(-k,  -  G,  +  k,  +  G,  -  k,,  -  G,.  +  k,  +  G, ) 

Sa;,.,.,(g,)a-„(g,) 

S(q,C))  Vq  G,,G2.G,.,G5. 

•A,„(G.)A^,.(G.)-— 


(ra.4-6) 


(in.4-7) 


*5(-kj,  -  G,,  +  kj  +  Gj  -  kj,  -  Gj.  +  kj  +  G^ ) 

where  5  is  the  wave  vector  conserving  delta  function.  Probability  of  transition  from  initial  state  1  is 
obtained  by  equation  (IQ. 4-3)  summing  over  k2,  kr,  k2’ ; 

Wu(l)=  XWna2-^r,2')  (1114-8) 

k2.ki’.kr 

The  rate  is  calculated  numerically  in  such  a  way  as  to  conserve  both  momentum  and  energy  of  carrier 
in  each  transition. 

Using  the  two-band  approximation^*,  we  calculate  the  matrix  element  Ma  and  Mb,  unlike 
aggiimptinn  that  the  matrix  element  is  constant  so  it  can  be  brought  out  of  the  sum  motion  in 
calculation  of  scattering  rate,  which  are  dependent  not  only  on  the  initial  and  final  energy,  but  also  on 
the  angle  between  the  wave  vector  of  those  two  states.  The  matrix  elements  can  be  written  as: 


1  (  1 

Mf=— - {(aj  4-bjCos0j +CjCos^0j)*(aj+b,cos0, +CiCos^0,)} 

‘  /I  .  o  \/  *  ^ 


4Vs  V„ 


(k,+G,-k.-G.)^ 


(m.4-9) 

jM.f  {(^r  +b,cose^  +c,cos"ej)*(a,.  +b,,cose,,  +c,cos^0,) 

(01.4-10) 

where  0i  (I=l,2,r,2’)  is  the  angle  between  different  states  1,  2,  1’  and  2’.  The  a,  b  and  c  are 
constants  which  dependent  on  the  state  energy  and  the  effective  mass  of  different  valley  in 
conduction  band  Substituting  (10.4-9)  and  (10.4-10)  into  equation  (10.4-3)  and  (10.4-8),  and 
change  the  sum  in  (10.4-8)  to  integral,  we  obtain: 
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(m.4-11) 


W(k„k„  k,.,k:.)  =  ^|Mf5(E,  +  E,  -  E,  -  E,) 


W(k,)  = 


2n 


JJj  <ik,dk,dk,.|Mf  6(E,  +  E,  -  E,  -  E^. )  (HI. 4- 12) 


Following  the  usual  definition,  the  impact  ionization  coeflScient  a  is  given  as  the  ratio  of  the  average 
probability  of  intact  ionization  to  the  average  of  electron  drift  velocity; 

OO  OO 

a  =  j  W,(e)Q(8)fo(s)d8  /  v  jQ(8)fo(8)d8  (m.4-13) 

0  0 

where  Wii(8)  is  the  intact  ionization  scattering  rate,  Q(8)  is  the  density  of  states,  and  the  vj  is  the 
electron  drift  velocity.  For  the  range  of  electric  field  values  relevant  to  the  impact  ionization  process, 
Vd  can  be  approximated  by  the  saturation  velocity  Vs. 


ni.4.3  Results 

In  this  section,  the  impact  ionization  rate  calculated  from  the  formal  principle  is  shown  in 
FigsIII.4. 1-5  for  a  few  compound  semiconductor. 

From  the  results  for  impact-ionization  rate  calculation,  we  observe  that  the  scattering  rate  is 
strongly  dependent  on  the  band  gap  of  different  materials.  When  the  band  gap  of  material  is  around 
1.0  eV,  the  scattering  rate  increases  with  the  of  the  initial  energy.  However,  it  is  quite  different  in 
the  narrow-bandgap  semiconductors,  such  as  InAs  and  InSb.  For  these  materials  the  scattering  rate 
increase  rapidly  near  the  threshold  energy,  and  then  decreases. 

In  Fig.  in. 4-6  the  velocity-electric  field  characteristics  for  GaAs  is  plotted  with  temperature 
as  a  parameter.  Both  with  and  without  impact  ionization  resuhs  are  shown.  In  Fig.in.4-7  and 
Fig.in.4-8,  the  the  velocity-electric  field  characteristics  for  Ino  j3Gao47As  and  InAs  are  shown.  For 
wide  bandgap  materials  (like  GaAs),  impact  ionization  does  not  modify  the  velocity-electric  field 
characteristics.  However,  with  decreasing  band  gap,  the  contribution  of  impact  ionization  in  the 
velocity-electric  field  characteristics  become  obvious  as  is  shown  in  In  Fig.III.4-7  and  Fig.III.4-8.  In 
narrow-bandgap  material,  the  velocity-electric  field  characteristics  can  be  divided  electric  field  in  the 
first  regions,  imder  low  electric  field,  the  electron  energy  is  lower  than  impact  ionization  threshold 
implying  that  intact  ionization  is  not  occurring  and  will  resuh  in  drift  velocity  comparable  for 
without  in:q)act  ionization.  In  the  second  region,  under  high  electric  field,  impact-ionization  scattering 
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becomes  dominant  scattering  mechanism,  and  the  drift  velocity  in  the  process  of  intact  ionization  is 
lower  than  without  1.1.  calculation  in  this  case,  electrons  loose  energy  during  the  impact  ionization 
process,  and  transfer  energy  and  momentum  to  the  secondary  electron-hole  pair.  In  the  third  re^on, 
rmder  very  high  electric  field,  some  electrons  gain  high  energy  from  electric  field  during  free  flight 
and  does  not  ‘Teel”  the  affect  of  electrons  in  valence  band,  therefore  irq)act  ionization  does  not  show 
affect  the  velocity-electric  field  characteristics.  Moreover,  with  increasing  temperature,  the  incident 
electron  energy  increase  making  intact  ionization  more  probable  as  is  shown  in  high  temperature 
plots  in  Fig.III.4-6. 

In  Fig.III.4-9,  the  impact  ionization  coefficients  is  plotted  as  a  fimction  of  electric  field  at 
different  ten^erature  for  Ino.53Gao.47As.  The  effect  of  temperature  is  similar  as  that  of  Fig.  111.4-6.  In 
Fig.in.4-10,Fig.in.4-ll  and  Fig.ni.4- 12,  low  field  mobility,  peak  velocity  and  saturation  velocity  are 
plotted  as  a  fimction  of  In  mole  fraction  for  InxGai,xAs  system  at  room  temperature.  In  the  inside, 
the  difference  between  the  results  with  and  without  intact  ionization  results  are  shown.  As 
observed,  impact  ionization  decreases  p,  Vp  and  Vs  with  increasing  In  mole  fraction.  Beyond  the  In 
fraction  of  80%,  the  difference  is  quiet  noticeable  and  will  affect  device  performance. 


IV  Device  characterization 
IV.l  DC  Analysis 

The  current- voltage  characteristic  is  evaluated  in  the  reduced  potential  scheme^'.  To  facilitate 
calculation  the  self-consistently  calculated  x,v  and  Ef  are  ejq)ressed  in  the  following  fimctional  forms: 

+  (IV.1-1) 


Ep  (n/x))  =  Epo+r  ln(/i,(x)) 

=  E,o+yEE,{n,{x))  itV) 

where  x  represents  the  position  in  the  channel  with  respect  to  the  source  at  x=0.  The  numerical 
values  of  a,  b,  Epo  and  y  are  obtained  by  fitting  the  x„  vs  ns  and  Ep  vs  n*  curves  obtained  by  solving 
Schroedinger  and  Poisson’s  equations.  The  2DEG  concentration  using  reduced  potentials  can  be 
written  as 


^  s\V^-LE,{x  =  Q)lq-V,\  e-\V,ls 

n  (x  =  0)  =  — 1 ^  =  ■ 


(IV.  1-3) 
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where  the  reduced  potential  at  the  source  end  of  the  device  is; 


F,-AE,(x  =  0)/g-F, 

Yt\ 

d- Ad{n,ix  =  Gi))  and  d  is  the  thickness  AlGoAsSb  layer,  A/(/i,(x  =  0))  denotes  the 

effective  channel  width  at  the  source  and  equals  ®))  •  should  be 

mentioned  that  A/(/2,(x))  is  a  function  of  n,  and  is  properly  updated  depending  on  w,(x)  (the 
channel  is  narrower  at  the  source  ( x  =  0 )  and  widens  as  one  move  towards  the  drain  region 
(x  =  Lg)).  Using  the  definition  of the  gate-source  voltage  may  be  written  as 


v^,  =  v,+s-\v,\^rM-^) 


(IV.1-5) 


where  the  threshold  voltage  =(/>,-  qlsiM^d/  jl  +  N,d,d,  +  N,d'^  / 2) .  ,  dj,  d,  and  Nj  are  the 

Schottky  barrier  height,  the  thickness  oiAlGoAsSb  donor  layer,  the  thickness  of  spacer  layer  and 
doping  concentration  oiAlGoAsSb  layer,  respectively. 

V^,  =  V^+s-\Vr\  +  r  hi(/i,(x  =  0))  (IV.  1-6) 

The  saturation  drain  current  Icjat  in  the  reduced  potential  scheme  can  be  written  as; 

- p ■  cos-'(^)])  (IV.  1-7) 

V..-V(x  =  L^)- AEp(x  =  I^)lq-VT  v.-e  v.-e 


^ c^at 


wdiere  p  = 


— 


Kl 


is  the  assumed 


k—f+s^ 

V  Po 

velocity-electric  field  characteristic.  The  magnitude  of  the  drain-sovuce  voltage  for  a  given  drain 
current  is  given  by 

+  -\f -'’■‘'fiiY]  (tV.l-8) 


q  G,  ^  "  I.  '  A/./, 

for  0  <  /,  <  7,^^ .  When  the  device  is  in  saturation,  /,  >  7,^ ,  the  drain-source  vohage  can  be 
written  as 


=  (V.-Vr  +  -(VC. -R. 


•ff 


^0^0 


-iri) 


Here  Ic  is  the  channel  current,  ,  P  =  2/3,  ,  Gq={Z-Z’vJ  d^g ),  Lg  and  Z  are  the  gate  length  and 


width,  respectively.  7<,  is  the  source  resistance  and  is  the  drain  resistance.  To  fit  the  experimental 
I-V  ciuves  in  the  saturation  region,  a  parasitic  resistance  Rp  is  introduced.  The  modified  drain-source 
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current  I(b=Ic+  VjJRp.  The  evaluation  of  the  dc  small  signal  parameters,  namely  (a)  transconductance 
gn,,  (b)  drain  resistance  rd  and  (c)  the  gate  capacitance  Cos  are  evaluated  by  using  the  method  detailed 
in  Ref^^ 

IV.2  Results  and  discussion 

An  Ino.53  Ga  0.47  As/Alo  e  Gao.4  AsSb  double  barrier  HEMT  with  a  QW  width  of  lOOA  is 
considered.  The  calculated  band  ofiset  is  0.984eV.  A  low  field  mobility  of  53,000  cmW-s  and  a 
saturation  velocity  of  3.2xl0’  cm/s  is  used  in  the  simulation’.  The  donor  doping  density  of  2x  10*® 
cm'®  with  a  doped  epilayer  thickness  of  350A  is  used. 

The  current-voltage  characteristics  are  shown  in  Fig.  IV.  1-1  wfiere  the  gate  bias  is  varied 
from  -1.0  V  to  OV  at  an  interval  of  0.5V.  An  inq)act  ionization  constant  of  5x10^  /cm  is  used  in  the 
computation.  The  I-V  curves  for  higher  gate  bias  clearly  shows  the  effect  of  intact  ionization. 

Fig.  IV.  1-2,  shows  the  calculated  transconductance  as  a  fimction  of  the  gate  bias.  A 
maYimnm  transconductance  of  800  mS/mA,  for  a  gate  length  of  1pm,  is  obtained  for  the 
rmoptimized  structure. 

1V.3  Small  signal  analysis 

HEMTs  using  InGaAs  as  the  channel  material  are  ideal  for  their  superior  transport  properties 
and  are  being  used  in  millimeter  wave  technology.  However,  the  narrow  band  gap  of  InGaAs  makes 
it  susceptible  to  impact  ionization.  Impact  ionization  in  HEMTs  gives  rise  to  kinks  in  the  ciurent- 
voltage  (I-V)  characteristics,  current  instability  and  hysteresis  along  with  high  gate  current.  Impact 
ionization  is  also  known  to  increase  device  noise®®.  Impact  ionization  is  initiated  at  the  drain  end  of 
the  gate  wfiere  a  large  electric  field  exists.  Kruppa  et.  aL®^  have  investigated  intact  ionization  in 
AlSb/InAs  HEMTs  and  have  modeled  the  inductive  behavior  of  S22  by  a  series  RL  circuit  at  the  drain 
end  of  the  small  signal  equivalent  circuit.  Reuter  et.  aL  suggested  a  voltage  controlled  current  somrce 
in  an  RC  environment  to  e?q)lain  the  inductive  S22  due  to  inq)act  ionization.  The  current  source  is 
controlled  by  the  voltage  across  the  gate-drain  region  namely  the  voltage  across  Cgd-  It  should  be 
mention  that  the  inductive  nature  of  S22  is  observed  only  at  low  frequencies  suggesting  that  inq)act 
ionization  may  be  dominant  only  at  the  low  frequency  end  of  the  ^ectrum  (upto  a  few  GHz).  This 
frequency  dependence  of  the  uipact  ionization  current  source  is  obtained  by  the  arrangement  of  the 
RC  network  in  an  otherwise  frequency  independent  gm'.  Thou^  the  inductive  nature  of  S22  indicates 
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the  presence  of  in5)act  ionization  an  exact  quantification  is  extremely  difficult.  Reuter  et.  al”  used  a 
genetic  optimization  technique  to  obtain  the  impact  ionization  induced  transconductance  gm'.  In  this 
paper,  we  report  a  technique  that  enables  one  to  determine  gm'  directly  from  experimental  data  and 
does  not  require  optimization^^. 

The  lattice  matched  InP  based  HEMT  structure  is  described  in  Ref  The  gate  length  and 
widths  are  0.15  fim  and  40  |un  re^ectively.  The  measurements  and  simulations  are  carried  out  at 
room  temperature.  The  small  signal  equivalent  circuit  reported  by  Webster  et.  al.^®  is  modified  by 
incorporating  the  impact  ionization  current  source  at  the  output  with  the  RC  branch  as  shown  in 
Fig.rV.2-1.  The  optimized  small  signal  equivalent  circuit  is  shown  for  bias  conditions  Vg=-0.3V, 
Ig=-6.4  |jA,  Vd=1.5V  and  lD=7.8mA.  The  circuit  is  optimized  to  the  experimental  S-parameters  over 
the  frequency  range  of  0.04GHz  to  40  GHz.  In  Fig.IV.2-2,  the  experimental  and  modeled  S- 
parameters  are  shown  and  the  agreement  over  the  frequency  range  is  excellent.  The  inductive  S22 
carries  the  signature  of  impact  ionization  as  modeled  by  the  voltage  dependen  t  current  source  at  the 
output  in  accordance  with  the  work  reported  by  Reuter  et.al.^^.  Though  an  optimization  of  the 
circuit  may  fit  the  measured  S-parameters,  does  not  provide  an  experimental  measure  of  gm'. 

In  Fig.IV.2-3,  the  experimental  Y21  is  shown  as  a  fimction  of  frequency,  for  Vd  =  1.5  V  with 
varying  gate  potential  An  interesting  point  to  note  is  the  sudden  increase  in  Y21  at  to— >0  or  near  dc. 
This  sudden  variation  can  be  explained  by  inspecting  Y2iat  dc.  For  the  intrinsic  circuit,  Y2i((o)  is 
expressed  as: 


X2I  ”  Sm  J 


(gm+j®C 

DC  X^+j^Ccs) 


-j^Cos- 


R 


GS 


R. 


,  1  1 

j®(^DC  ^Gs) D  D 


-)  (IV.2-1) 


^GS 


where  o  is  the  angular  frequency  and  the  rest  of  the  parameters  are  defined  in  the  small  signal 
equivalent  circuit  shown  in  Fig.IV.2-1.  Under  dc  condition  and  with  R<fa»Ri  the  above  expression 
reduces  to: 


Y2i(G>->0)  =  gm  +  gm'.  (IV.2-2) 

The  e?q)ression  strongly  suggests  that  the  sudden  change  in  Y2i(co)  as  co  approaches  0  is  due  to  gm  • 
This  provides  a  means  to  determine  gm'  purely  from  ejq)erimental  data  for  the  first  time.  In  Fig.rV.2- 
4,  the  extracted  gm'  is  shown  as  a  function  of  gate  and  drain  bias.  For  a  given  gate  bias  gm'  increases 
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monotonically  with  drain  bias  and  may  be  e}q)lained  by  accounting  for  the  increasing  gate-drain 
electric  field.  The  variation  of  gm'  with  gate  bias  for  a  given  drain  potential  peaks  at  a  certain  gate 
bias.  This  phenomena  is  due  to  the  dependence  of  gm'  upon  both  the  gate-drain  electric  field  and  the 
drain  current.  For  a  given  drain  bias  the  gate-drain  electric  field  Edg  increases  as  the  gate  bias 
becomes  more  negative.  Consequently  the  ionization  constant  increases.  However,  with  more 
negative  gate  bias  the  drain  current  decreases  and  eventually  at  some  negative  gate  bias  the  channel  is 
pinched  ofE  Therefore,  at  some  intermediate  gate  voltage  gm'  will  maximize  and  in  Fig.IV.2-4(b),  its 
dependence  is  shown  for  difierent  drain  bias. 

V  Noise 

Modeling  of  noise  is  based  upon  a  self-consistent  solution  of  Schroedinger  and  Poisson's 
equations.  The  analysis  of  noise  is  based  on  the  identification  of  the  difierent  noise  source  that 
present  in  the  channel,^  '^  namely  (a)  Johnson  Noise  in  the  ohmic  region,  (b)  noise  associated  with 
spontaneous  generation  of  the  dipole  layers  in  the  saturation  region,  (c)  gate  noise  due  to  elementary 
voltage  fluctuations  in  the  channel  and  (d)  induced  gate  noise  in  the  saturation  region.  Moreover, 
noise  due  to  impact  ionization  is  introduced  through  gm'.  The  noise  spectrum  of  the  in:q)act  ionization 
current  source  is  assumed  to  be  white  (in  Sec.  6  this  aspect  of  the  study  is  discussed). 

The  self- consistent  noise  model^’  is  extended  to  calculate  noise  properties  in  InP-based 
HEMTs.  By  accounting  for  the  noise  sources  and  matching  the  optimized  external  somce  in^edance 
to  the  transistor,  the  minimum  noise  figure  Fmin  and  minimum  noise  tercperature  Tmin  are  calculated 
as^’ : 

F..i,=l  +  2-g.(R,+jR:+i)  (V-1.1) 

V  8n 

T^=2  T  g.  (R.+R„,)  (K)  (V-1.2) 

where 

g.=«.(f)’(B-  +  P-2C>/ra:)  (V.1.3) 

Ij 

R.  =R.+Rg+R.  (V-1.4) 

where  rn  is  the  noise  resistance,  Rc  is  the  correlation  resistance,  Rs  and  Rg  are  the  source  and  drain 
reastances  and  R  =  LgA^,  Cgs,  is  the  gate  charging  resistance  and  Lg  represaits  the  length  of  the 
gate.  P,  R  and  C  represent  the  noise  coefficients  and  Rs,opt  is  the  optimal  external  source  resistance. 
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gn,  and  Cg,  are  the  device  transconductance  and  gate  capacitance,  respectively.  T  is  the  operating 
temperature  in  0  K. 

V.l  Results  and  Discussion 

An  Ino.53  Ga  0.47  As/Alo.e  Gao.4  AsSb  double  barrier  HEMT  with  a  QW  width  of  lOOA  is 
considered.  The  calculated  band  ofifeet  is  0.984eV.  A  low  field  mobility  of  53,000  cmW-s  and  a 
saturation  velocity  of  3.2xl0’  cm/s  is  used  in  the  simulation^®.  The  donor  doping  density  of  2x  10‘® 
cm‘^  with  a  doped  epilayer  thickness  of  350A  is  used. 

Fig.V-1,  shows  the  mmimiim  noise  figure  as  a  fimction  of  frequency.  The  gate  and  drain 
biases  are  1.5V  and  2.0V,  respectively.  As  observed,  the  minimum  noise  figure  is  less  than  IdB  at 
60GHz.  An  optimization  based  upon  (a)  QW  width  (b)  gate  length  and  (c)  doped  epilayer  thickness 
will  enable  the  realization  of  lower  F„i„  upto  lOOGHz.  Moreover,  the  role  of  gate  drain  separation 
on  noise  due  to  impact  ionization  is  under  investigation. 

VL  Frequency  Spectrum  of  Impact  Ionization 

In  order  to  investigate  the  frequency  dependence  of  the  inq)act  ionization  process  two 
different  studies  are  carried  out,  namely  (a)  time  dependent  solution  of  Boltzman  transport  equation 
using  Monte  Carlo  simulation  and  (b)  analytical  study  using  continuity  equation. 

VLl  MC  Simulation  of  Time  Dependent  Electrons  Distribution 

In  tbig  section,  some  results  of  MC  simulation  for  time  dependent  electrons  distribution  in  k- 
space  as  well  as  in  real  space  are  shown.  The  electric  field  is  appUed  along  the  z  direction.  The 
region  between  gate  and  drain  in  HEMTs  where  the  reverse  electric  field  is  extremely  high  and 
impact  ionization  could  happen  is  under  investigation. 

In  Fig. VI.  1-1,  electron  position  in  k-space  is  plotted  with  time  period  as  a  parameter.  In 
Fig.sVI.  1.2-3  the  distribution  of  electron  position  in  real  space  is  plotted  as  a  fimction  of  time  period 
as  well  as  the  distribution  of  drift  velocity.  With  increasing  time,  the  electrons  distribution  in  k-space 
and  in  real  space  become  broadened.  The  average  velocity  of  electrons  decrease  with  increasing 
time.  With  longer  timft  period  (correq)ond  to  low  frequency),  the  electrons  have  to  suffer  more 
scattering  events,  thus  their  energy  (vector  in  k-q)ace)  and  their  position  in  real  ^ace  become  more 
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random  and  consequently  velocity  is  reduced.  This  may  result  in  an  additional  phase  shift  in  the  drain 
current  with  the  magnitude  of  the  current  remaining  the  same. 
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Initial  State 
z=0.0,ei=^.026eV,t=0.0 


VL2  Frequency  Dependent  Drain  Current  Due  to  Impact  Ionization 

Intact  ionization  process  is  important  in  narrow  bandgap  channel  HEMTs  and  needs  to  be 
investigated. 

Following  the  approach  of  H.K  Gummel  and  J.L.Blue^®,  we  calculate  the  hole  current  (which 
equal  to  electron  cmrent  due  to  impact  ionization)  as  a  function  of  drain  current  as  well  as  frequency. 
Equal  ionization  coefficients  were  assumed  for  holes  and  electrons  and  sum  of  the  hole  and  electron 
currents  in  the  region  between  gate  and  drain  was  assumed  constant. 

From  the  continuity  equations  for  holes  and  electrons,  we  obtain: 

|■(p+Il)  +  |■(v^p-v.n)=2(g+r)  (VI.2-1) 

at  oz 

where  Vh,  Ve  is  velocity  of  holes  and  electrons  respect,  the  noise-less  carrier  generation  rate 
g  =  av^p  +  Pv^n  ,  and  y  is  a  stochastic  generation  rate  ( which  we  assume  it  is  equal  to  zero  ). 

By  using  saturation  velocity  approximation  which  is  reasonable  under  the  high  electric  field  can  make 
impact  ionization  happen  and  assume  p  is  neghgible,  we  get; 

I„,=qv.n+|:eE  (VI.2-2) 

at 

After  some  steps,  we  obtain  the  drain  current  is; 

-^(j©-2pv,  XZ“Zo)+j©t 

I(x,t)  =  I„e''‘  (VI.2-3) 

where  lo  is  the  current  inject  from  gate  end  which  we  assume  it  is  constant.  So  the  hole  current  in  the 
gate-drain  region  due  to  carrier  multiplication  can  be  write  as: 

1  V  — (jo_2pv,)Z+j«)t 

=  — Jl.e'"  d2-I,  (VL2-4) 

Hd  0 

So  we  can  obtain  frequency  dependent  hole  current  as: 


Ip(®) 


(cosot  +  j  sin  otX  A  -i-  JB) 


(VI.2-5) 


where 


CO  — uji- 

A  =  2P(1  -  cos — —  - e^'‘sm  — 

V.  V,  V 


(VL2-5.1) 
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(VI.2-5.2) 


B  =  ^(i_  )-  2Pe-^'‘^-  sin^ 

V  V  V 

^  c  c  ® 

From  Figs.  4-1  and  4-2,  we  see  that  there  is  a  significant  change  of  the  current  magnitude  above  Vc  ( 
around  lOOGHz ),  implying  that  I.I  should  be  taken  into  account  above  this  frequency. 


Vn  Quaternary  channels 

In^Gai_^Asi_ySbyh\SxcQ  matched  to  AlaGai_^Asj_f,Sbi,\&  investigated  as  a  possible 

candidate  for  channel  material  for  high  performance  HEMTs.  In  the  following  few  paragraphs  a 
justification  for  the  proposed  quaternary  channel  material  is  provided. 

o 

For  an  In  mole  fraction  of  0.3  and  a  lattice  constant  of  5.948  A  (lattice  matched  to  AlAsSb) 
AF.^  equals  0.9eV.  The  corresponding  Sb  mole  fraction  is  (y  =  5.6533  -  0.4051  x)/(0.4426  - 

0.0216  x)),  where  is  the  lattice  constant  of  the  quaternary)  is  0.4  yielding  a  band  gap  of  0.6  eV. 
The  corresponding  T  -L  separation  is  a  modest  0.6  eV.  It  should  be  noted  that  the  maximum  AE^ 
for  lattice  matched  InAs/AlAsSb  is  1.1  eV  with  the  InAs  band  gap  of  0.36  eV.  Therefore,  the  band 
gap  imprnve.s  from  0.36eV  to  0.6  eV  without  sacrificing  AE^  if  InAs  is  substituted  by 
In^Gaj_^Asj_ySby.  A  deep  QW  guarantees  electron  confinement  along  with  a  higher  2DEG 
concentration,  however,  a  decrease  of  AE^  from  1.1  eV  to  0.9  eV  does  not  radically  change  the 
2DEG  concentration  as  the  higher  sub-bands  are  sparsely  populated.  On  the  other  hand,  an  increase 
of  the  band  gap  from  0.36eV  to  0.6  eV  will  require  that  the  electron  gain  an  additional  0.36  eV 
(=1.5*(0.6-0.36))  of  energy,  from  the  appHed  field,  making  possible  a  great  reduction  in  inq)act 
ionization. 

The  room  temperature  low  field  mobihty  |i(x,y)  of  In^Go As ]_ySby  is  obtained  by  using  the 
following  relationship; 

^(x.y)  =  8.5xl0^+L95xl0'’x-3.5xl0%+5.55xl0'‘xy  (VH-l) 

The  relationship  is  obtained  by  using  the  interpolation  formulation  reported  by  Moon  et.  ai  [9]. 
Proper  study  of  low  field  mobility  should  be  earned  out  by  using  a  Monte  Carlo  simulation. 
However,  the  above  equation  provides  an  approximate  number  so  that  the  material  under 
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investigatiott  can  at  least  be  con:q)ared  to  In/jOt-tAs-  Eqn.  3,  for  Ino.53Gao.47As  provides  a  low  field 
mobility  value  of  L8x  10  cm^/F-s  in  accordance  wWi  the  e>q)erimental  value  of  L4xl0  cm /V-s 

^^4lereas  p  or  Ino.3Gao.7Aso.4Sbo.o  equals  2.22x10  cm /V-s,  better  than  the  low  field  mobihty  of 

Ino.53Gao.47As,  however,  much  less  than  that  oilnAs. 

Infia,_,^ASf_ySbj,  seems  to  be  a  good  candidate  for  channel  material  in  HEMTs  since  the 

band  gap  and  A£<. ,  of  the  quaternary  charmel  material,  is  large  enough  to  suppress  impact  ionization 
and  provide  a  high  2DEG  concentration  with  good  confinement.  Also,  the  mobility  is  higher  that  that 
of  lno.5^ao.47As.  Providing  an^  of  500  GHz  for  a  0. 1  pnt  gate. 

In  order  to  justify  the  use  of  the  above  enqrirical  relation^iip  for  low  field  mobility  the 
velocity-electric  field  for  quaternaries  are  determined.  Given  the  alloy  scattering  rate  of  the  binaries 
the  alloy  scattering  rate  P(x,y)  of  the  quaternary  AxBi.xCyDi.y,  is  suggested  to  be: 

P(x,  y )  =  (1  -  xjyPec  +  (1  -  xXl  -  y )Pbd  +  xy^Ac  +  x(l  -  y)P^  (Vn-2) 
wfiere  x  and  y  are  the  mole  fi:actions  in  InxGai.xASySbi.y  and  Pab  are  the  alloy  scattering  rate  of  the 
binaries.  In  Fig.  VII.  1,  the  alloy  scattering  rate  in  InxGai.xAsySbi.y  as  a  fimction  of  x  and  y  are  shown. 

In  Fig.Vn-2,  the  drift  velocity  is  plotted  as  a  fimction  of  electric  field  for  different  channel 
materials.  InGaAsSb  system  has  hi^er  low  field  mobility  and  saturation  velocity  than  InGaAs 
system  The  low  field  mobility  extracted  from  MC  calculation  agrees  quite  well  with  the  data 
obtained  from  the  empirical  formulation. 

The  use  of  a  quaternary  channel  can  be  made  more  attractive  if  the  following  scheme  is  used. 
In  alloys  of  InGaAsSb  alloy  scattering  will  be  one  of  the  dominant  process.  If  thin  multiple  layers  of 
InAs-GaSb  are  grown  till  the  same  channel  width  is  obtained  it  may  provide  a  better  velocity-electric 
field  characteristic.  In  the  proposed  growth,  alloy  scattering  is  replaced  by  interface  scattering. 
However,  as  there  are  less  interfeces  channel  transport  will  hxprove.  In  Fig.  Vn-3,  the  velocity- 
electric  field  characteristics  for  alloyed  and  siq)erlattice  (SL)  channel  are  corrq)ared.  The  SL  chaimel 
has  improved  low  field  mobility  and  peak  velocity. 
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VUL  Conclusion 

A  complete  model  that  includes  (a)  quantum  well  calculations  (b)  transport  and  (c)  noise  for 
deep  quantum  well  HEMT  is  presented.  Tune  independent  Bohzman  transport  equation  (BTE)  is 
solved  using  ensemble  Monte  Carlo  to  study  the  velocity-electnc  field  characteristic  in  InxGai-xAs. 
Impact  ionization  in  narrow  band  gap  channel  gives  rise  to  a  degradation  in  the  low  field  mobility  and 
peak  velocity.  The  solution  of  the  time  dependent  BTE  indicates  that  inq)act  ionization  process  is 
frequency  dependent  and  this  dependence  is  also  a  fimction  of  drain  current  and  source  drain  spacing. 
InxGai.xAsySbi.y  provides  excellent  transport  properties  and  may  be  used  as  a  channel  material  in 
HEMTs.  The  usefiilness  of  the  quaternary  channel  can  be  fiuther  enhanced  if  a  superlattice  channel  is 
used  instead  of  an  alloyed  channel 
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DC  Figures 


Fig.II-1  Fermi  level  and  X„  as  a  function  of  2DEG  concentration  at  t=300K 
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Fig.III.2-1  Impurity  scattering  rate  calculation  for  GaAs 
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Drift  velocity  (  10  '  cm/s  )  Scatterlno  rate  ( • 


Fig.in.2-3  The  drift  velocity  vs.  electric  field  in  KGa  i-xAs  as  a  function  of  In  mole  fraction 
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Drift  velocity  (  x10  '  cm/sec  ) 
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Fig.III.2-4  The  drift  velocity  vs.  electric  field  in  AlxGa  i-xAs  as  a  function  of  In  mole  fraction 


Fig.lII.2-5  The  drift  velocity  vs.  electric  field  in  Ino.53Gao.47As  at  various  temperatures 
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Fig. III. 2-6  The  drift  velocity  vs.  electric  field  in  InAs  at  various  temperatures 
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Fig.  III.4-1.  Impact  ionization  scattering  rate  calculation  for  InP 
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Fig.III.4-7  Drift  velocity  vs.  electric  field  with  and  without  impact  ionization  for  Ino,53Gao.47As 
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Fig.  III.4-9  Impact  ionization  coefBcient  vs.  electric  field  at  different  temperatures  for  Ino.33Gao,47  As  at 

different  temperature. 


Fig.III.4-10  The  low-field  mobility  in  InxGai.,As  as  a  function  of  In  mole  fraction  at  room  temperature 
(and  the  difference  for  p  between  with  and  without  impact  ionization  scattering). 
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111.4-12  The  saturation  velocity  in  In^Gai-xA :  as  a  function  of  In  mole  fraction  at  room  temperature 
(and  the  difference  for  V,  between  with  and  without  impact  ionization  scattering). 
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Fig.  111.4-1 1  The  peak  velocity  in  InxCai.^As  as  a  function  of  In  mole  fraction  at  room  temperature  (and 
the  difference  for  Vp  between  with  and  without  impact  ionization  scattering). 
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Fig.IV.2-1  Measured  and  modeled  S-parameter  of  a  0.15pmx40nm  Ino.52Alo.48As/Ino.53Gao.47  As  HEMT. 
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Fig.IV.2-2  Optimized  small  signal  equivalent  circuit 
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Fig.IV.2-3  Measured  Y21  of  a  0.15nmx40^m  Ino.52Alo  ,i*As/Ino.53Gao  .nAs  HEMT  as  a  function  of  source-drain  bias  for  a 

gate  bias  of  -0.2  V. 


(a)  (b) 

Fig.  IV.2-4  Transconductance  due  to  impact  ionization  in  Ino.52Alo.48As/Ino.53Gao.47As  HEMT 
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INVESTIGATING  THE  ALGORITMIC  NATURE  OF  THE  PROOF  STRUCTURE 
OF  ORA  LARCHAHDL  TO  IMPROVE  ITS  PERFORMANCE 


Ahmed  E.  Barbour 
Professor  of  Computer  Science 
Mathematics  and  Computer  Science  Department 
Georgia  Southern  University 

Abstract 

The  algorithmic  nature  of  the  proof  structure  of  LarchA^HDL  theorem  prover  developed  by  Odyssey 
Research  Associates  (ORA)  is  investigated.  General  rules  have  been  established  to  prove  class  of  logic  circuits 
with  and  without  delay.  The  complexity  of  the  prove  for  others  classes  are  also  investigated  and  suggestions  to 
improve  them  are  discussed.  Several  other  issues  related  to  improving  the  performance  of  the  theorem  prover 
and  training  scientists  and  engineers  to  understand  the  basic  principles  of  the  formal  verification  and  the  proof 
stnx:ture  of  the  theorem  prover  as  well  as  how  to  use  LarchyVHDL  theorem  {rover  are  also  discussed. 
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INVESTIGATING  THE  ALGORITMIC  NATURE  OF  THE  PROOF  STRUCTURE 
OF  ORA  LARCHA^HDL  TO  IMPROVE  ITS  PERFORMANCE 

Ahmed  E.  Barbour 


1.  Introduction 

Hardware  Description  Language  (VHDL)  was  established  as  an  EEEE  standard  for  the  design  and  docu¬ 
mentation  of  digital  electronic  systems.  It  was  developed  in  response  to  the  Computer  Aided  Design  (CAD) 
community’s  need  to  handle  larger  and  more  complex  designs  and  the  need  to  be  able  to  electronically  exchange 
design  information  [1-4],  Simulation  and  formal  verification  are  emerged  as  the  only  reasonable  alternative 
tools  which  can  be  used  effectively  to  verify  complex  systems.  Simulation  techniques  use  exhaustive  testing  of 
the  VHDL  model  on  several  levels  to  determine  the  correctness  of  a  design.  Formal  verification  of  hardware  is 
a  mathematical  proof  which  shows  that  the  design  of  a  digital  circuit  satisfies  certain  prc^rties  regardless  of  the 
values  of  the  inputs.  Formal  verification  tools  can  also  be  used  to  prove  that  hardware  designs  satisfy  pn-operties 
such  as  functional  correctness,  security,  and  timing  ctHrectness. 

Odyssey  Research  Associate  (ORA)  had  devel(^)ed  an  Ada  verification  envircmment,  known  as  Penelope 
which  also  used  in  the  development  of  LarchATlDL  theorem  prover  [5-8].  Penelope  is  based  upon  the  Larch 
two-tiered  specification  language  devel(^)ed  at  the  Massachusetts  Institute  of  Techncdogy  (MIT)  [10-13].  The 
first  tier,  the  Larch  Shared  Language  (LSL),  is  a  first  order  predicate  calculus  used  to  build  the  traits,  or  theories, 
that  define  the  sorts  used  by  the  target  language  (in  the  case  of  VHDL,  types,  such  as  bit,  word,  string,  arrays, 
integer,  ..  etc.).  The  seccHid  tier,  called  the  Interface  Language,  defines  the  communication  mechanisms  of  the 
taiget  language,  m  this  case  VHDL,  in  the  Larch  notation.  LSL  is  used  to  mathematically  model  data  objects 
and  t^jerations  on  those  objects,  while  the  interface  language  maps  the  VHDL  model  into  the  abstractions 
represented  by  the  Larch  expressions  for  the  purpose  of  formal  reastming.  LarclVVHDL  verification  environment 
is  an  interactive  tool  that  helps  its  user  to  develop  and  verify  digital  electronic  hardware  designs  written  in 
VHDL  [5-9],  The  designer  develops  a  VHDL  model  from  a  specification  in  a  way  that  ensures  the  VHDL  model 
win  meet  the  specificaticxi.  Figure  1  shows  the  structure  of  LarchATIDL  formal  proof  methodology. 
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This  report  investigates  the  algorithmic  nature  of  the  proof  structure  of  LarchATlDL  theorem  prover.  Section  2 
presents  the  general  techniques  used  by  LarchATIDL  to  establish  the  proof  of  any  logic  structure  expressed  in 
VHDL.  To  understand  the  specification  written  fa  VHDL  entities  and  the  verificatioi  conditions  that 
LarchAnDL  generates.  Section  3  highlights  some  of  the  basic  ctmcepts  of  VHDL  semantics  in  this  area.  Sec¬ 
tion  4  investigates  the  methodology  used  to  establish  the  proof  of  different  logic  functions.  General  rules  have 
been  discovered  and  formulated  fa  certain  classes  of  logic  functions  in  Section  S.  Section  6  discusses  some 
special  cases  in  which  the  general  rules  are  not  applied.  The  impotance  of  introducing  formal  methods  into  the 
undergraduate  curriculum  is  presented  in  Section  7.  Section  8  coicludes  the  outcomes  of  this  study  and 
highlights  the  direction  fa  further  research  waks  to  be  done  in  this  field. 


Figure  1;  Formal  Verification  Using  LarchAUDL  Technique 

2.  Techniques  Used  to  Establish  the  Proof 

Larch/VHDL  verification  system  is  a  collectioi  of  tools  integrated  under  the  control  oi  a  graphical  user 
interface  called  gui.  The  canavas  of  the  gui  depicts  objects  and  relations.  LarclVVHDL  supports  a  window 
interface  envirounent  with  several  advanced  features  fa  entering  specificatiois,  developing  code,  and  providing 
access  to  the  Penelope  theorem  prover.  The  user  creates  an  entity,  writes  its  VHDL  entity  specification  and 
writes  its  Larch/VHDL  specification  in  the  fom  of  a  list  of  guarantees.  Then  a  command  is  invdced  to  install 
the  Larch/VHDL  specificaticm  in  the  library.  An  entity  specification  assigns  to  a  VHDL  entity  three  optional 
parts: 
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(1) 


a  set  of  imported  theories. 


(2)  a  name  and  sort  for  its  internal  state,  and 

(3)  a  list  of  guarantees,  where  a  guarantee  is  a  Boolean  term  in  the  language  introduced  by  the  imported 
theories  [18]. 

Any  syntax  and  sort-checking  errors  in  the  specification  ate  reported  by  the  system.  Then  the  user  creates  a 
VHDL  architecture  for  the  entity  and  inv(^ces  the  Verificatioa  Condition  (VC)  generator.  It  creates  Latch 
theory  that  declare  constants  for  all  the  ports,  signals,  and  generics  and  asserts  the  constrains  implied  by  the  sub- 
types  of  these  VHDL  objects.  The  VC-theory  COTtains  an  obligation  to  be  proved  of  the  form; 

Given  A,  Prove  That  A  Implies  B  (  A  -->  B), 

where  A  are  the  guarantees  of  the  processes  in  the  architecture  body,  and  B  are  the  guarantees  from  the 
specification  of  the  entity  being  verified  [8].  Once  the  system  has  generated  the  VC,  the  user  creates  a  proof 
object  and  invokes  the  prover.  While  proving  the  VCs,  the  user  can  easily  identify  from  the  signal  names  which 
line  of  the  VHDL  architecture  each  hypothesis  come  from. 


Figure  2:  LarchAUDL  Proof  Structure 

If  the  user  is  unable  to  complete  the  proof  because  a  particular  hypothesis  is  not  strong  enough  then  the  user 
knows  which  line  of  the  VHDL  is  suspect  If  all  the  proof  obiigadoos  can  be  proved,  then  the  given  architec¬ 
ture  satisfies  the  specification.  All  these  proof  steps  will  be  illustrated  by  examples  in  Section  4.  Figure  2  shows 


2-5 


the  general  relationship  among  entity,  architecture,  specifications,  and  theoies  used  to  help  the  p-oof  construct. 

The  LarchA^HDL  environment  includes  a  large  body  of  traits  that  define  the  basic  constructs  of  digital 
design  such  as  bit,  vector,  gate,  logic  operations  and  so  on.  Traits  define  sorts  Gogical  types)  and  state  proper¬ 
ties  or  assertions  that  must  hold  true.  Traits  also  contain  theorems  which  are  statements  that  are  deducible  from 
assertions,  previously  deduced  theorems,  and/or  the  assertions  or  theorems  of  other  traits  that  are  included.  The 
two-tiered  Larch  approach  allows  designers  the  capability  to  extend  the  library  of  traits  in  wder  to  support  user 
defined  sorts  in  their  models.  Once  implemented,  the  traits  are  available  as  library  components  for  reuse  in 
other  applications.  Traits  are  used  to  capture  the  concepts  and  relationships  used  in  distal  design.  There  are 
traits  devoted  to  arithmetic  concepts,  and  to  data  structures  such  as  anays  and  lists.  To  support  VHDL  seman¬ 
tics  there  are  traits  defining  signals,  and  signal  delay,  and  other  oxicepts  needed  to  express  the  semantics  of 
VHDL.  Ihere  are  traits  that  describe  the  relationship  between  bit  level  pieraticns  and  their  arithmetic  interpre- 
taticm,  in  2’s  complement  or  unsigned  bit-level  representations. 

Penelope  Larch/VHDL  theorem  prover  includes  a  simple  proof  editw/checker  for  predicate  calculus  that 
provides  a  munber  of  proof  rules  for  performing  simplificaticm  and  proofs.  Penelope  ^plies  the  rules  according 
to  user  directions  and  indicates  to  the  user  what,  if  anything,  sdll  has  to  be  proved  after  each  step.  Each  state¬ 
ment  to  be  proved  ot  simplified  is  presented  in  the  fram  of  a  sequent,  a  set  of  hypotheses  and  a  ctmclusicHi. 
Each  proof  in  Penelope  takes  place  in  the  context  of  an  available  theory.  Within  a  VHDL  design  unit,  the  theory 
is  determined  by  entity  declaration  annotations  and  all  the  local  lemmas  currently  being  applied  to  complete  the 
proof.  The  theory  that  is  available  f<x  proving  a  given  lemma  consists  of  the  axioms,  assumptions,  and  proved 
lemmas  that  precede  the  given  lemma. 

3.  Some  of  The  Basic  Concepts  of  VHDL  Semantics 

The  basic  ccmcepts  of  the  semantics  of  the  VHDL  as  described  in  the  Language  Reference  Manual  OLRM) 
and  how  they  are  modeled  in  Larch/VHDL  are  very  important  to  understand  the  specifications  of  the  VHDL 
entities  and  the  VCs  generated  by  the  system.  The  semantics  of  VHDL  describe  what  a  VHDL  simulate  must 
do.  Briefly,  a  simulates:  will  start  by  elaborating  the  VHDL  code  to  create  processes,  signals  and  their  drivers 
and  initializing  them.  Then,  the  simulator  proceeds  by  executing  simulatitm  cycles  consecutively,  updating  the 
processes  signals  and  drivers  and  advancing  time  at  each  cycle  [18].  In  wder  to  write  specification,  terminology 
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(4)  proof-structuring  rules,  such  as  proof  by  cases  or  proof  by  induction. 

The  following  examples  illustrates  the  components  of  the  proof  structure  of  a  VHDL  unit  and  the  commands 
used  to  perform  the  required  proof. 


5.1  Simple  Logic  Gate  Example:  Two-Input  AND  Gate 

Figure  4  shows  the  two-input  AND  gate  and  its  proof  structure  as  created  by  LarchA^HDL. 


Figure  4:  Two-Input  AND  Gate  and  Its  Proof  Structure 


This  example  illustrates  the  proof  methodology  for  a  simple  logic  gate.  All  other  logic  gates  follow  the  same 
prcxrf  technique  established  in  this  example. 

(1)  The  proof  structure  for  any  lo^c  gate  is  generated  by  creating  a  VHDL  entity  to  the  AND  gate  as  shown. 


entity  and2  is 

port  (  X,  y  :  Bit;  2  :  out  Bit); 
end  and2; 


Invoking  the  command:  Make  Lib  from  the  entity  iccoi  will  check  the  correctness  of  the  VHDL  syntax 
and  if  it  is  correct,  will  create  a  library  few*  the  AND  gate. 

(2)  The  second  step  is  to  create  the  arch  icon  from  the  entity  icon  as  shown. 


architecture  arch  of  and2  is 
begin 

2  <=  X  and  y; 
end  arch; 
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(3)  The  third  step  is  to  express  specificadons  in  a  specification  language  provided  by  LarchATIDL  system.  It 
provides  a  specification  language  for  asserting  specifications  of  functitmality  and  timing.  The  created 
specifications  from  the  entity  icon  is  shown. 

entity  and2 

guarantees 

always 

(  z  =  T’ )  =  ( (x  and  y)  T’ ) 
with  z  delayed  by  0  from  x,  y 
end 

Invoking  the  oxnmand:  Make  Lib  from  the  specification  icon  will  check  the  syntax  correctness  of  the 
specificadcHi  and  if  no  errOT  is  generated,  it  will  create  a  specification  Library. 

(4)  The  last  step  is  to  create  the  proof  icon  and  invcrfces  the  Verification  Condition  (VQ  generator  from  the 
architecture  icoa  by  using  the  command:  Generate  Verification  which  generates  the  VCs  required  to  be 
proved  by  the  Penelope.  If  there  is  no  error  generated  by  all  these  three  steps,  the  prover  may  be  invoked 
frOTn  the  proof  icon.  The  proof  screen  will  q)en  asking  to  add  the  VCs  to  be  proved.  If  you  click  on  the 
command-  Insert  Obligation,  the  VCs  will  be  inserted  in  its  positicm  on  the  screen  as  shown  below. 


— >  trait  arch_and2_VC  has  unfinished  proofs 
-I  Larch 


proof  sectira  for  arch_and2_VC 
--I  proof  secticHi 

— >  trait  arch_and2_VC  has  unfinished  proofs 
-I  vc  in  trait  aich_and2_VC 
-4  fyoof: 

»  (Forall  a: 

State:: 

y’last_event  @  a  >  0  and  x’lastjevent  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a,  x’lasLevent  @  a)) 
->  (Fcyall  a: 

State:: 

x’last_event  @  a  >  0  and  y’last_event  @  a  >  0 
->  (z  @  a  =  T’)  =  ((x  @  a  and  y  @  a)  =  T’) 
and  z’last_event  @  a  >=  niin(x’last_event  @  a,  y'lasLevent  @  a)) 
<proof> 

-4  end  proof  section 
—1  end  Larch 


It  is  clear  that  the  VHDL  code  of  two-input  AND  gate  shown  in  the  architecture  part  is  converted  to  Larch  state¬ 
ments  in  the  form: 
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»  Forall  a  (dme).  if  y  is  stable  and  x  Is  suble  for  a  >  0. 

->  it  implies  that  z  at  a  =  x  at  a  and  y  at  a 

and  z  will  have  been  stable  at  least  since  the  most  recent  change  in  x  and  y. 

Also,  the  specification  code  of  the  AND  gate  is  converted  to  Larch  statements  in  the  form  similar  to  the  archi¬ 
tecture  fwm  shown  previously.  The  proof  may  be  constructed  using  the  simple  logical  argument: 

Given  A,  Prove  that  A  Implies  B. 

The  Perelope  environment  provides  an  opportunity  to  record  and  check  the  designer’s  reasoning.  In  almost  all 
cases,  the  proof  problem  presented  is  partirmed  and  transfOTtned  using  previously  proven  theorems.  The  step- 
by-step  proof  of  two-input  AND  gate  is  as  shown. 

(1)  The  first  step  is  to  strip  off  the  quantifier  and  make  the  precondition  into  hypothesis  and  do  simplification 
by  lining  the  command:  By  synthesis/analysis  of  FORALL/IMPLIES  as  shown. 


--1  proof  sectitm 

— >  trait  arch_and2_VC  has  unfinished  proofs 
-I  VC  in  trait  arch_and2_VC 
--1  proof: 

BY  synthesis/analysis  of  FORALL/IMPLIES 

1.  y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  tnin(y’Iast_event  @  a.  x’last_event  @  a) 

2.  Fwail  a: 

State:: 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a.  x’last_event  @  a) 

3.  x’lastjevent  @  a  >  0 

4.  y’lastjevent  @  a  >  0 

»  (z  @  a  =  ’1’)  =  ((x  @  a  and  y  @  a)  =  ’!’) 
and  z’last_event  @  a  >=  min(x’last_event  @  a.  y’last_event  @  a) 
<proo£> 

--1  end  proof  secticm 
"I  end  liuch 


The  cmnmand:  synthesis-conclusion,  is  first  selected  and  then  the  second  selection  is  the  command: 
Forall/implies  synthesis,  which  is  very  often  used  as  the  first  step  in  a  poof.  If  the  goal  is  of  the  form 
Forall  alpha  A  ->  B, 

the  forall  quantifier  is  peeled  off  and  the  implies  antecedent.  A.  is  made  a  hypothesis,  (alpha  is  the  time 
variable  which  considers  as  a  free  variable  (a).)  If  we  add:  with  analysis,  as  an  option  for  the  previously 
selected  fwall  synthesis,  it  invokes  forall  analysis  and  applies  it  on  the  result  As  shown  from  the  exam¬ 
ple,  the  for  all  quantifier  has  reduced  to  simple  four  logical  imply  statements  which  need  to  be  proven. 
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(2)  The  second  step  is  to  ^ply  the  command:  simplify  and  select  the  optitm:  SDVS  simplification  as  shown. 


-I  proof  section 

— >  trait  aich_and2_VC  has  unfinished  proofs 
-I  VC  in  trait  aich_and2_VC 
--I  isoof: 

BY  synthesis/analysis  of  FORALL/IMPLIES 
BY  simplificaticm 

1.  0  <  y’last_event  @  a 

2.  0  <  x’last_event  @  a 

3.  z  @  a  =  (x  @  a  and  y  @  a) 

4.  min(y’last_event  @  a,  x’last_event  @  a)  <=  z’last_event  @  a 

5.  Forall  a: 

State:: 

0  <  y’last_event  @  a  and  0  <  x’last_event  @  a 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  min(y’Iast_event  @  a.  x’last_event  @  a)  <=  z’last_event  @  a 
»  min(x’last_event  @  a,  y’last_event  @  a)  «:=  z’last_event  @  a 
<proo£> 

--I  end  proof  sectitm 
--I  end  Larch 

The  rnmmgTvt-  simplify  has  several  kinds  of  simplification.  In  s«ne  cases  a  series  of  simplifications  is 
useful.  One  powerful  simplification  tool  is  SDVS  simplify.  It  is  a  decisi(m  procedure  which  can  reduce 
valid  formulas  about  the  integers  with  "+".  to  true.  That  means,  it  is  effectively  applied  to  sim¬ 

ple  arithmetic,  inequalities,  predicate-ffee  expressions.  It  is  very  useful  near  the  end  of  a  proof.  It  is  clear 
that  tins  type  of  simplification  is  not  powerful  enough  to  omplete  the  proof. 

(3)  The  final  step  is  to  use  the  ccanmand:  add  +  which  applies  rewriting  rule  to  our  simplificaticm.  So  when 
the  step  By  simplification+  occurs,  the  goal  is  simplified  by  using  all  the  equations  marked  morphism  as 
left  to  right  rewrite  rules.  Fot  example,  if  n,  m,  k  are  variables  of  s«t  Nat,  the  goal 


(n  <=  m  and  m  <=  k)  ->  (n  <=  k)  would  rewrite  to: 

(int(n)  <=  int(m)  and  int(m)  <=  int(k))  -->  (int(n)  <=  int(k)) 


and  this  would  simplify  to  true.  This  procedure  is  shown  below. 


proof  sectitm  fcff  arch_and2_VC 
--I  proof  section 
-I  VC  in  trait  arch_and2_VC 
“i  proof: 

BY  synthesis/analysis  of  FORALLyiMPLIES 
BY  simplification-!- 
BY  synthesis  of  TRUE 
--I  end  proof  section 
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-I  end  Larch 


This  last  step  jyoves  the  VCs  by  issuing  the  final  command:  By  synthesis  of  TRUE. 

It  is  clear  that  this  proof  methodology  is  short  and  very  powerful.  The  proof  could  be  lengthy  if  we  select 
another  methodology.  For  example,  using  the  following  ccmimands; 

(1)  synthesis-conclusion  followed  by  Forall/implies  synthesis  will  produce  similar  result  as  shown  in  step 
(me. 

(2)  If  we  use  the  command:  synthesis<onclusion  again  and  then  selecting  the  command:  and  synthesis,  the 
result  of  the  prcxrf  simplification  will  be  as  shown. 


--1  proof: 

BY  synthesis  of  FORALL/IMPLIES 
BY  synthesis  of  AND 

1.  a: 

State:: 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a,  x’last_event  @  a) 

2.  x’last_event  @  a  >  0 

3.  y’last_event  @  a  >  0 

»  (z  @  a  =  T ’)  =  ((x  @  a  and  y  @  a)  =  T’) 

<proof> 

1.  Forall  a: 

State:: 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a.  x’last_event  @  a) 

2.  x’last_event  @  a  >  0 

3.  y’last_event  @  a  >  0 

»  z’last_event  @  a  >=  min(x’last_event  @  a,  y’last_event  @  a) 
<proof> 

--I  end  proof  secticm 
-I  end  Larch 


The  command:  and-synthesis  is  used  when  the  goal  has  the  form 
»  A  and  B. 

This  step  splits  the  proof  into  two  subproof,  one  with  goal  A,  and  the  other  with  goal  B.  If  the  (injunc¬ 
tion  is  larger,  then  mcae  subproofs  are  created.  Now  we  have  two  subproofs  which  need  to  be  proved. 

(3)  The  next  step  is  to  prove  the  first  subproof  as  shown. 


"1  proof: 

BY  synthesis  of  FORALL/IMPLIES 
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BY  synthesis  of  AND 
BY  analysis  of  FORALL, 

Wrni  a  FOR  a:State 

1.  Forall  a: 

State:; 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’Iast_event  @  a.  x’last_event  @  a) 

2.  x’last_event  @  a  >  0 

3.  y’last_event  @  a  >  0 

4.  y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a,  x’last_event  @  a) 
»  (z  @  a  =  ’!’)  =  ((x  @  a  and  y  @  a)  =  ’1’) 

<proof> 

1.  Forall  a: 

State;: 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a,  x’last_event  @  a) 

2.  x’last_event  @  a  >  0 

3.  y’last_event  @  a  >  0 

»  z’last_event  @  a  >=  niin(x’last_event  @  a,  y’last_event  @  a) 
<proof> 

~1  end  proof  secticm 
--I  end  Larch 


Using  the  command:  analyze-hypothesis,  it  will  open  to  you  the  sub-menu  buttons  which  contain  several 
fcwms  of  the  selected  hypothesis.  These  hypothesis  options  are: 

and-analysis,  equals-analysis.  exists-analysis,  forall-analysis, 
if-else-analysis,  if-then-analysis,  imp-analysis,  not-equals-analysis, 
not-analysis,  CM'-analysis,  xor-analysis,  and-analysis,  ...etc. 

Our  selection  will  be  imp-analysis  which  will  simplify  the  first  subproof  as  shown  above. 

(4)  The  next  step  is  to  use  the  command:  thinning  which  is  used  to  thin  out  hypotheses  that  are  not  needed  to 
complete  the  proof.  There  are  a  variety  of  template  matching  parameters  to  help  describe  the  thinning  rule 
to  be  applied.  We  will  use  one  of  them,  the  command:  (binding  *)  as  showtL 


--I  fffoof: 

BY  synthesis  of  FORALL/IMFLIES 
BY  synthesis  of  AND 
BY  analysis  of  FORALL. 

WTITH  ga  FOR  ga;  State 
BY  thinning  (binding  *) 

1.  x’last_event  @  a  >  0 

2.  y’last_event  @  a  >  0 
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3.  y’last_event  @  a  >  0  and  x’last_eveat  @  a  >  0 
•>  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a,  x’last_event  @  a) 
»  (z  @  a  =  ’  1’)  =  ((x  @  a  and  y  @  a)  =  ’  1’) 

<proof> 

1.  Forall  a: 

State:: 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  nun(y’last_event  @  a.  x’last_event  @  a) 

2.  x’last_event  @  a  >  0 

3.  y’last_event  @  a  >  0 

»  z’last_event  @  a  >=  niin(x’last_event  @  a.  y’last_event  @  a) 
<proof> 

--I  end  proof  section 
--1  end  Larch 


(5)  The  last  step  of  this  subproof  Is  to  use  the  command:  simplify  with  SDVS  simplification  optiai  to  com¬ 
plete  the  proof  as  shown. 


--I  proof: 

BY  synthesis  of  FORALL/IMPLIES 
BY  synthesis  of  AND 
BY  analysis  of  FORALL, 

WITH  a  FOR  a:State 
BY  thinning  (binding  *) 

BY  simplification 
BY  synthesis  of  TRUE 

1.  Forall  a: 

State:: 

y’last_event  @  a  >  0  and  x’last_event  @  a  >  0 
->  z  @  a  =  (x  @  a  and  y  @  a) 

and  z’last_event  @  a  >=  min(y’last_event  @  a.  x’last_event  @  a) 

2.  x’last_event  @  a  >  0 

3.  y’last_event  @  a  >  0 

»  z’last_event  @  a  >=  min(x’last_event  @  a,  y’last_event  @  a) 
<proof> 

"1  end  proof  section 
-I  end  Larch 


(6)  We  repeat  the  same  proof  steps  of  the  first  subproof  to  prove  the  next  subproof  as  shown. 


"I  proof: 

BY  synthesis  of  FORALL/IMFLIES 
BY  synthesis  of  AND 
—I  Subproof  #  1 

BY  analysis  of  FORALL. 

WITH  a  FOR  a:  State 
BY  thinning  (binding  *) 

BY  simplification 
BY  synthesis  of  TRUE 
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-I  Subproof  #  2 

BY  analysis  of  FORALL, 
WITH  a  FOR  a:State 
BY  thinning  (binding  *) 
BY  simplification+ 

BY  synthesis  of  TRUE 
"I  end  proof  secticQ 
"I  end  Larch 


5.2  More  Complex  Logic  Circuit:  Full  Adder 

A  full  adder  represents  the  basic  building  logic  structure  to  build  mwe  complex  adder  which  adds  two 
operands.  Figure  5  shows  the  logic  structure  of  the  full  adder. 


Figure  5:  Full  Adder 

The  proof  structure  for  a  full  adder  will  be  axisidered  from  two  different  ^proaches. 
(1)  The  first  approach  is  the  architectural  Oogic  gates)  approach  as  shown. 


entity  full_adder  is 

port(x,  y,  ci  :bit;  s,  co  :  out  bit); 
end  full^adder, 

architecture  arch  of  full.adder  is 
compment  and2 

port(x,y  :  Bit;  z  :  out  Bit); 
end  comp<ment; 
compcoent  ex_or 

port(x,y  ;  Bit;  z  :  out  Bit); 
end  component; 
component  <x2 

port(x,y  :  Bit;  z  :  out  Bit); 
end  competent; 
signal  zl.  z2,  z3  :bit; 
begin 
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11:  ex_or  port  map(x,y.zl): 
12;  ex_or  port  map{ci^l.s): 
13:  and2  port  map(x,y,z2); 
14;  and2  port  map(zl.ci^3); 
15:  or2  port  map{z2,z3.co); 
end  arch; 


entity  full_adder 
guarantees 
always 

s  =  (  X  XOT  y  xor  ci) 
with  s  delayed  by  0  frcm  x,y,ci 
guarantees 
always 

CO  =  ( ( (x  and  y  )  or  (ci  and  (x  xor  y)))) 
with  CO  delayed  by  0  from  x.y^i 


The  VCs  for  this  logic  structure  is  generated  as  shown. 


--I  Larch 

proof  section  for  arch_full_adder_VC 
-I  proof  secdcMi 

— >  trait  arch_full_adder_VC  has  unfinished  proofs 
-I  VC  in  trait  arch_full_adder_VC 
-I  proof: 

»  ((((Forall  a: 

State:: 

min(x’last_event  @  a,  y’last_event  @  a)  >  0 
->  zl  @  a  =  (x  @  a  XOT  y  @  a) 
and  zriast_event  @  a 

>=  min(x’last_event  @  a,  y’last_event  @  a)  -  0) 
and  (Forall  a; 

State;; 

inin(ci’last_event  @  a,  zl’last.event  @  a)  >  0 
->  s  @  a  =  (ci  @  a  xor  zl  @  a) 
and  s’last_event  @  a 

>=  min(ci’last_event  @  a,  zl’lasLevent  @  a)  -  0)) 
and  (pOTall  a: 

State:: 

niin(x’last_event  @  a.  y’last_evenl  @  a)  >  0 
->  (z2  @  a  =  T’)  =  ((x  @  a  and  y  @  a)  =  ’1’) 
and  z2’last_event  @  ga 

>=  min(x’last_event  @  a,  y’last_event  @  a)  -  0)) 
and  (Forall  a: 

State:: 

inin(zriast_event  @  a,  ci’last_event  @  a)  >  0 
->  (z3  @  a  =  T’)  =  ((zl  @  a  and  ci  @  a)  =  T’) 
and  z3’last_event  @  a 

>=  min(zriast_event  @  a,  ci’last_event  @  a)  -  0)) 
and  (Forall  a: 

State;; 

min(z2’last_event  @  a,  z3’last_event  @  a)  >  0 
->  (co  @  a  =  ’!’)  =  ((z2  @  a  or  z3  @  a)  =  T’) 
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and  co’last_event  @  a 

>=  niin(z2’last_eveiit  @  a,  z3’last_event@  a)  •  0) 

->  (Forall  a: 

State:: 

nim(niin(x’last_eveiit  @  a,  y’last_event  @  a),  ci’last_event  @  a) 
>0 

->  s  @  a  =  ((x  @  a  xor  y  @  a)  xor  ci  @  a) 
and  s’last_event  @  a 

>=  min(min(x’last_event  @  a,  y’last_event  @  a), 
ci’last_event  @  a)  -  0) 
and  (Fwall  a: 

State:: 

min(niin(x’last  event  @  a,  y’Iast_event  @  a),  ci’last  event  @  a) 
>0 

->  CO  @  a 

=  ((x  @  a  and  y  @  a)  or  (ci  @  a  and  (x  @  a  xor  y  @  a))) 
and  co’last_event  @  a 

>=  iiiin(min(x’last_event  @  a,  y’last_event  @  a), 
ci’last_event  @  a)  -  0) 

<proofc» 

"I  end  proof  sectitm 
--1  end  Larch 


The  ccanplexity  of  the  Vdls  is  due  to  the  amplexity  of  the  logical  structure.  It  reflects  all  the  logical  com¬ 
ponents  used  to  structure  the  full  adder.  The  proof  of  this  complex  structure  is  strrught  fcxward  and  follows  the 
same  proof  trend  of  the  two-input  AND  lo^c  gate  shown  previously. 


-I  proof: 

BY  synthesis/analysis  of  FORALI7IMPLIES 
BY  synthesis  of  AND 
-A  The  ^t  subproof 

BY  synth^s/analysis  of  FORALL/IMPLIES 
BY  thinning  (binthng  •) 

BY  simplification-^ 

BY  synthesis  of  TRUE 
-I  The  seoHid  subproof 

BY  synthesi^analysis  of  FORALL/IMPLIES 
BY  thinning  (binding  ♦) 

BY  simplification-i-,  rewriting-H,  simplification 
BY  synthesis  of  TRUE 
“I  end  proof  sectitxi 
"1  end  Larch 


The  proof  structure  of  the  full  adder  should  be  split  into  two  parts  using  and  synthesis  as  shown  above.  By 
using  this  command,  the  proof  complexity  is  reduced  to  two  subproofs  due  to  the  existetK:e  of  the  two  Boolean 
functions:  sum  and  carry  generated  in  the  architecture  and  in  the  specificaticMi.  After  that  split,  the  proof  stnx:- 
ture  follows  the  same  trend  used  befcxe  to  prove  the  subproof. 
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(2)  The  second  approach  is  the  behavioral  approach  as  shown. 


entity  fadder  is 

port  (a,  b,  ci :  in  Bit;  s,  co  :  out  Bit); 
end  fadder. 


architecture  behavi<x  of  fadder  is 
begin 

s  <=  (a  xor  b)  xor  ci; 

CO  <=  (a  and  b)  or  ((a  or  b)  and  ci); 
end  behaviOT; 


entity  fadder 
includes  (Bit2Int) 
guarantees 
always 

int(a)  +  int(b)  +  int(ci)  =  2*int(co)  +  mt(s) 
with  s,  CO  delayed  by  0  from  a,  b.  ci 
end 


In  this  example,  there  is  no  logical  gate  in  the  architectural  structure  of  the  full  adder.  It  is  only  the  Boolean 
functions  which  generate  the  sum  and  the  carry.  The  specification  part  indicates  the  general  requirement  of  the 
full  adder  which  is  independent  from  the  logical  structure.  That  means,  it  is  independent  from  the  Boolean 
equations  used  to  generate  the  sum  and  carry.  It  is  clear  that  the  specification  of  the  behavior  of  the  full  adder 
needs  the  trait:  Bit2Int  to  help  in  the  proof  structure  and  to  make  it  more  manageable  to  complete  the  proof. 
The  VCs  generated  by  LarchA^HDL  theorem  prover  is  shown  below. 


—I  proof  secticm 

— >  trail  behavior_fadder_VC  has  unfinished  proofs 
—I  vc  in  trait  behavior_fadder_VC 
-I  proof: 

»  (Forall  a: 

State:: 

ci’lasLevent  @  a  >  0 

and  (b'last_event  @  a  >  0  and  a’last_event  @  a  >  0) 

->  s  @  a  =  ((a  @  a  xor  b  @  a)  xor  ci  @  a) 
and  s’last_event  @  a 
>=  min(ci’last_event  @  a, 
min(b’last_event  @  a,  a’last^event  @  a))) 
and  (Forall  a: 

State:: 

ci'last_event  @  a  >  0 

and  (b’lasLevent  @  a  >  0  and  a’last_event  @  a  >  0) 

->  CO  @  a 

=  ((a  @  a  and  b  @  a)  or  ((a  @  a  or  b  @  a)  and  ci  @  a)) 
and  co’last^event  @  a 
>=  niin(ci’last_event  @  a. 
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niinG>’last_event  @  a.  a’last_event  @  a))) 

->  (Forall  a; 

State:: 

(a’last_event  @  a  >  0  and  b’last_event  @  a  >  0) 
and  ci’last_event  @  a  >  0 
->  (int(a  @  a)  +  mt(b  @  a))  +  int(ci  @  a) 

=  2  ♦  int(co  @  a)  +  int(s  @  a) 
and  (s’last_event  @  a 

>=  niin(min(a’last_event  @  a,  b’lastjevent  @  a), 
ci’lastjevent  @  a) 
and  co’lastjevent  @  a 

>=  min(niin(a’last_event  @  a.  b’last_event  @  a), 
ci’lastjevent  @  a))) 

<pr(X)£> 

"I  end  proof  section 
"I  end  Larch 


It  is  very  clear  that  the  VCs  reflect  the  two  majcff  parts  of  the  proof:  the  architecture  behavioral  part  and  the 
specification  part.  The  proof  structure  is  shown  below. 


-I  proof: 

BY  synthesis/analysis  of  PORALIVIMPLIES 
BY  thinning  (binding  ♦) 

BY  using  fulladder  in  trait  BitAdder 
rewriting  left  to  right 
BY  simplificatioo+ 

BY  syndesis  of  TRUE 
-1  end  proof  secticxi 
"I  end  L^h 


From  the  previous  proof,  it  is  clear  that  the  proof  structure  follows  the  same  trend  used  in  the  previous  examples 
Gogic  gates).  The  cmly  difference  is  the  use  of  the  theory  of  fulladder  in  the  trait  BitAdder  defined  in  the 
specification  which  makes  the  proof  easier  to  handle  by  providing  the  mathematical  operatiaas  for  ccaiverting  Bit 
to  Ingeger  and  vice  versa. 


Figure  6:  Four-Bit  Ripple  Adder 
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4J  Complex  Logic  Structure:  Four-Bit  Ripple  Adder 

Figure  6  shows  the  logical  structure  of  the  four-bit  ripple  adder. 

In  this  example,  the  logical  complexity  of  the  four-bit  adder  does  not  affect  the  proof  structure  as  shown. 


entity  fourbit_adder  is 

port(xl,  yl.  x2.  y2,  x3,  y3,  x4,  y4.  ci  :bit;  si.  s2,  s3,  s4.  cl.  c2.  c3.  co  :  out  bit); 
end  fourbit_adder. 


architecture  arch  of  fourbit_adder  is 
competent  full_adder 

port(x,y,ci :  Bit;  s,co  :  out  Bit); 
end  component; 
signal  cl.c2.c3  :bit; 
begin 

11:  full.adder  port  map(xl,yl.ci.sl.cl); 

12:  full^adder  port  map(x2.y2.cl.s2.c2); 

13:  full^adder  port  map(x3.y3,c2.s3.c3); 

14:  full_adder  port  map(x4.y4,c3.s4,co); 
end  arch; 


entity  fourbit_adder 
guarantees 
always 

si  =  (  xl  xor  yl  xor  ci) 
with  si  delayed  by  0  from  xl.yl.ci 
guarantees 
always 

cl  =  ( ( (xl  and  yl  )  or  (ci  and  (xl  xot  yl)))) 
with  cl  delayed  by  0  from  xl.yl.ci 
guarantees 
always 

s2  =  (  x2  xor  y2  xor  cl) 
with  s2  delay^  by  0  fr<Mn  x2.y2,cl 
giiarantees 
always 

c2  =  ( ( (x2  and  y2)  <x  (cl  and  (x2  xor  y2)))) 
with  c2  delayed  by  0  from  x2.y2.cl 
guarantees 
alw'ays 

s3  =  (  x3  xor  y3  xor  c2) 
with  s3  delay^  by  0  frem  x3.y3,c2 
guarantees 
always 

c3  ( ( (x3  and  y3  )  OT  (c2  and  (x3  xot  y3)))) 
with  c3  delayed  by  0  from  x3.y3.c2 
guarantees 
always 

s4  =  (  x4  xor  y4  xor  c3) 
with  s4  delay^  by  0  from  x4,y4.c3 
guarantees 
always 

co  =  ( ( (x4  and  y4  )  ot  (c3  and  (x4  xor  y4)))) 
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with  CO  delayed  by  0  from  x4.y4.c3 


end 


—I  proof: 

BY  synthesis  of  FORALL/IMPLIES 
BY  synthesis  of  AND 
BY  hypothesis 
BY  hypothesis 
BY  hypothesis 
BY  hypothesis 
BY  hypothesis 
BY  hypothesis 
BY  hypothesis 
BY  hypothesis 
—I  end  pro^  secticm 
-I  end  Latch 


It  is  clear  that  the  complexity  of  the  logical  strucmre  shown  previously  does  not  prevent  the  proof  to  be  very 
simple  and  straight  ftxward.  However,  the  complexity  of  the  proof  will  be  increased  if  the  specification  and  the 
architecture  bit  vector  operation  and  iteration  (for  statement)  in  their  construcdcxi  as  shown  in  Secticm  5. 


4.4  Logic  Structure  with  Time  Delay 

The  following  example  of  two-input  AND  gate  shows  the  effect  of  signal  delayed  by  an  amount  del  on 
the  proof  structure.  The  amount  of  delay  will  be  defined  as  a  positive  integer  value  as  shown  below. 


entity  and2_del  is 

generic  (del :  positive); 
port  (  X,  y  ;  Bit;  z  :  out  Bit); 
end  and2_del; 

architecture  arch  of  and2_del  is 
begin 

z  <=  X  and  y  after  del; 
end  arch; 


entity  and2_del 

guarantees 

always 

z  =  (x  and  y) 

with  z  delay^  by  del  from  x.  y 
end 


— >  trait  arch_and2_del_VC  has  unfinished  proofs 
-1  Larch 

proof  section  fw  arch_and2_del_VC 
"1  proof  sectiOT 

— >  trait  aich_and2_del_VC  has  unfinished  proofs 
—I  VC  in  trait  arch_and2_del_VC 
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-I  proof: 

»  (Forall  a: 

State:: 

inin(y*last_eveiit  @  a,  x*last_event  @  a)  >  del 
->  z  @  a  =  (x  @  a  and  y  @  a) 
and  z’lasLevent  @  a 

>=  min(y’lasLevent  @  a,  x’last_event  @  a)  -  del) 
->  (Forall  a: 

State:: 

min(x’last_event  @  a,  y’last_event  @  a)  >  del 
->  z  @  a  =  (x  @  a  and  y  @  a) 
and  z*last_event  @  a 

>=  niin(x’lasLevent  @  a,  y’last_event  @  a)  -  del) 
<proo£> 

-I  end  proof  secticMi 
-I  end  Larch 


-I  proof: 

BY  synthesis/analysis  of  PORALL/IMPLIES 
BY  thinning  (binding  ♦) 

BY  simpiificatioa+ 

BY  synthesis  of  TRUE 
-I  end  proof  secticm 
“I  end  Larch 


Ccmparing  the  two  previous  proofs  (and2  and  and2_del),  it  is  clear  that  there  is  no  difference  in  the  proof  struc¬ 
ture  of  the  two-input  AND  gate  withAvithout  delay.  The  difference  appears  when  the  delay  is  used  with  a  more 
complex  logic  circuit  as  shown  in  the  following  example  of  the  sum  output  of  a  full  adder  with  delay  as  shown 
below. 


entity  sum^del  is 

generic  (  del :  positive  ); 
p<xt(x,  y,  ci  :bit;  s  :  out  bit); 
end  sum_del ; 


architecture  arch  of  sum^del  is 
component  xcr_del 

generic  (  del :  positive  ); 
port(x,y  :  Bit;  z  :  out  Bit); 
end  component; 
signal  zl  :bit; 
begin 

11:  xor^del  generic  map  (del)  port  map(x,y,zl); 

12:  xor_del  generic  map  (del)  pcwt  map(ci^l^); 
end  arch; 


entity  sum_del 

guarantees 

always 

s  =  (  X  xor  y  xor  ci) 

with  s  delayed  by  2*del  from  x,yjci 
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end 


— >  trait  axch_sum_del_VC  has  unfinished  proofs 
-I  VC  in  trait  arch_suin_del_VC 
--I  proof; 

» (Forall  a: 

State:; 

mm(x’last_event  @  a,  y’last_event  @  a)  >  del 
->  zl  @  a  =  (x  @  a  xor  y  @  a) 
and  zriast_event  @  a 

>=  niin(x’last_event  @  a,  y’last_event  @  a)  -  del) 
and  (Forall  a: 

State:: 

niin(ci’last_event  @  a,  zriast_event  @  a)  >  del 
->  s  @  a  *  (ci  @  a  xor  zl  @  a) 
and  s’last_event  @  a 

>=  niin(ci’last_event  @  a,  zriast_event  @  a)  -  del) 

->  (ForaU  a; 

State;: 

niin(min(x’last_event  @  a,  y’last_event  @  a),  ci’last_event  @  a) 
>  2  *  del 

->  s  @  a  =  ((x  @  a  XOT  y  @  a)  xor  ci  @  a) 
and  s’last_event  @  a 

>=  niin(min(x’last_event  @  a.  y’last_event  @  a), 
ci’last_event  @  a)  -  2  ♦  del) 

<proofi> 

--I  end  proof  section 
"I  end  Larch 


-I  proof; 

BY  synthesis/analysis  of  FORALLAMPLIES 
BY  thinning  (binding  *) 

BY  using  del_c<xistraint  as  new  hypothesis 
BY  simplificatioo+ 

BY  synthesis  of  TRUE 
"I  end  proof  secticm 
-I  end  Larch 


It  is  clear  from  this  example  that  the  amount  of  delay  del  is  defined  as  two  xor_del  units  delay.  The  proof 
structure  is  similar  to  the  previous  one  for  the  two-input  AND  gate  with  delay.  The  cmly  difference  is  the  com¬ 
mand;  BY  using  del_constraint  as  new  hypothesis  which  introduces  the  delay  as  a  new  hypothesis  to  be  used 
by  the  simplifier  after  this  command.  Without  this  new  hypothesis,  which  has  been  built  into  the  library,  dte 
simplifier  will  not  be  able  to  simplify  the  {ffoof  to  TRUE. 


5.  The  Algorithmic  Nature  of  the  Proof 

From  the  above  examples,  any  logic  circuit  structured  using  logical  gates  has  similar  proof  steps.  It  is  very 
intersting  to  notice  the  fdlowing  rules  when  amstructing  the  proof  for  a  logic  circuit. 
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(1)  The  proof  for  any  logical  entity  should  be  started  by  the  ccmnnand:  By  synthesis  of  FORALL/IMPLIES. 

(2)  The  above  step  can  be  simplified  more  by  adding  the  command:  with  analysis  to  the  previous  synthesis 
command. 

(3)  The  proof  may  be  simplified  directly  using  the  command:  SDVS  simplification  with  add  +,  which  could 
end  the  proof  correctly  with:  By  synthesis  of  TRUE.  However,  if  the  logical  entity  is  very  complex,  then 
another  step  must  be  taken  using  the  command:  By  synthesis  of  AND,  which  will  ^lit  the  proof  into 
several  subproofs  as  shown  in  the  above  examples.  In  this  case,  each  subproof  must  be  proved  separately 
using  the  same  {ffoof  methodology  explained  above. 

(4)  If  the  logic  circuit  has  a  delay  which  is  defined  as  del  ,  then  the  command  BY  using  del_constraint  as 
new  hypothesis,  which  must  be  used  before  the  simplification  process.  Also,  it  is  very  important  to  count 
correctly  the  number  of  units  delay  which  must  be  inserted  in  the  spec'ificaticm  as  showiL  If  the  number  of 
delay  coimts  was  less  than  the  exact  number  of  units  delay,  the  prover  will  not  be  able  to  prove  the 
correctness  of  the  specificatiai. 

It  is  importance  to  notice  that  in  some  cases  the  command:  analysis  of  FORALL  may  be  (Knitted  fiom  the 
proof  steps  shown  previously. 

6.  Special  Cases:  Complex  Proof  Structure 

The  prcxrf  start  to  deviate  from  the  previous  algcffithmic  methcxl  when  the  entity  and/or  its  architecture 
contains  vectors  and/or  iteraticn  using  for  statement  as  shown  in  the  following  examples. 

6.1  Example  of  A  Vector  Signal  Assignment 

This  second  example  is  a  simple  mtxlified  version  of  the  a  simple  Bit-assignment;  it  is  a  Bit-vectcM-  assign¬ 
ment.  The  proof  ccmstructed  by  ORA  Larch/VHDL  became  complex  and  not  easy  to  be  followed  in  a  sys¬ 
tematic  way  as  shown  below. 

entity  vectcrl  is 
generic  (n  :  positive); 

pcHt  (  x  ;  Bit_vector  ((n-1)  downto  0) ;  z  :  out  Bit_vector  ((n-1)  downto  0)); 
end  vector  1  ; 
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architecture  arch  of  vectorl  is 
begin 

z  (n-l  downto  0)  <=  x  (n-1  downto  0); 

end  arch; 


entity  vectorl 

includes(StableArray) 

guarantees 

always 

z  =  X 

with  z  delayed  by  0  £r<Hn  x 
end 


Verifkation  Conditions 

» (Forall  a; 

State:: 

slice(x.  n-l.  0)’last_event  @  a  >  0 
->  slice(z  @  a.  n  -  1. 0)  =  slice(x  @  a.  n  -  1. 0) 
and  slice(z.  n-l.  0)’last_event  @  a 
>=  slice(x.  n-l.  0)’last_event  @  a  -  0) 

->  (Forall  a: 

State:: 

x’Iast_event  @  a  >  0 

->z@a  =  x@aand  z’last_event  @  a  >=  x’last_event  @  a  -  0) 


-I  Proof: 

BY  synthesis/anaiysis  of  FORALJVIMPLIES 
BY  analysis  of  IMPLIES. 

BY  a^ysis  of  FORALL. 

WTITH  a  FOR  a  :  State 
BY  analysis  of  IMPLIES. 

BY  using  last_event_slice  in  trait  BicStableArray  with  (x  for  x. 
a  fcx  a.  n- 1  f<v  i.  0  for  j)  as  new  hypothesis 
BY  simplification 
BY  synthesis  of  TRUE 
AND  THEN 
BY  hypothesis 
AND  THEN 
BY  synthesis  of  AND 
BY  thinning  (not  3) 

BY  using  eq  in  trait  Vector  with  (z  @  a  for  a. 

X  @  a  for  b)  as  new  hypothesis 
BY  rewriting 

BY  claiming  Forall  i :  Int:: 
in_dooiain(i.  z  @  init)->(z  @  a)[i]  =  (x  @  a)[i] 

BY  synthesis/analysis  of  FORALIVIMPLIES 
BY  thinning  2 

BY  using  in_domain  as  rewrite  rule 
BY  using  valid_slice  as  rewrite  rule 
BY  using  low  in  trait  Vector  as  rewrite  rule 
BY  using  high  in  trait  Vector  as  rewrite  rule 
BY  using  slire_val  in  trait  Vector  with  (z  @  a  for  a. 

n-l  fOT  i.  0  for  j.  i  for  n)  as  new  hypothesis 
BY  using  slice_val  in  trait  Vector  with  (x  @  a  for  a. 


2-24 


n-1  for  i,  0  for  j,  i  for  n)  as  new  hypothesis 
BY  rewriting  (6).  simplification 
BY  synthesis  of  TRUE 
THEN 

BY  simplification 
BY  synthesis  of  TRUE 

BY  thinning  (not  4) 

BY  claiming  slice(z.  n-1.  0)’last_event  @  a  =  z’last_event  @  a 
BY  using  last_event_slice  in  trait  Bit_StableArray  with  (z  for  x, 
a  for  a.  n-1  for  i.  0  for  j)  as  new  hypothesis 
BY  claiming  slice(z.  n-l,  0)’last_event  @  a<=z’last_event  @  a 
BY  using  last_event_compooents 

in  trait  Bit_StableArray  with  (z  for  x.  a  for  a. 
slice(z.  n-1.  0)’last_event  @  a  for  n)  as  new  hypothesis 
BY  claiming  slice(z.  n-1. 0)’last_event  @  a<=time(a) 

BY  rewriting 
BY  synthesis  of  TRUE 
THEN 

BY  claiming  Forall  i :  Inf.: 

in_dOTiain(i.  z  @  a) 

->  z[i]’last_cvent  @  a>=slice(z.  n-1.  0)’last_event  @  a 
BY  synthesis/analysis  of  PORALL/IMPLIES 
BY  thinning  menticming  time 
BY  using  last_event_slice_component 

in  trait  Bit_StableArray  with  (z  for  x.  a  for  a.  n-1  for  i.  0  for  j. 
i  for  k)  as  new  hypothesis 
BY  analysis  of  IMPLIES. 

BY  using  in_domain  as  rewrite  rule 
BY  using  valid_slice  as  rewrite  rule 
BY  using  low  in  trait  Vector  as  rewrite  rule 
BY  using  high  in  trait  Vector  as  rewrite  rule 
BY  rewriting  (6).  simplification 
BY  synthesis  of  TRUE 
AND  THEN 

BY  using  last_event_component 

in  trait  Bit_SlableA^ay  with  (slice(z.  n-1.  0)  for  x.  a  fcff  a, 
i  for  i)  as  new  hypothesis 
BY  simplification 
BY  synthesis  of  TRUE 
THEN 

BY  simplification 
BY  synthesis  of  TRUE 

THEN 

BY  simplificaticHi 
BY  synthesis  of  TRUE 
THEN 

BY  claiming  slice(x.  n-1. 0)’last_event  @  a  =  x’last_e\'ent  @  a 
BY  using  last_event_slice 

in  trait  Bit_Stable Array  with  (x  for  x.  a  for  a.  n-1  for  i. 

0  for  j)  as  new  hypothesis 
BY  claiming  slice(x.  n-1. 0)’last_event  @  a 
<=  x’last_eveiu  @  a 
BY  using  last_event_coniponents 

in  trait  Bit_StableAiTay  with  (x  ftx  x.  a  for  a. 
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slice(x.  n-l,  0)’last_event  @  a  fw  n)  as  new  hypothesis 
BY  claiming  slice(x,  n-l,  0)’last_event  @  a<=time(a) 

BY  rewriting 
BY  synthesis  of  TRUE 
THEN 

BY  claiming  Fwall  i :  Int; 
in_d(xnain(i.  x  @  a) 

->  x[i]’last_event  @  a>=slice(x,  n-l,  0)’last_event  @  a 
BY  synthesis/analysis  of  FORALL/IMPLIES 
BY  thinning  menticming  time 
BY  using  last_event_slice_comp(xient 

in  trait  Bit_StableArray  with  (x  for  x,  a  for  a,  n-l  fw  i,  0  for  j, 
i  for  k)  as  new  hypothesis 
BY  analysis  of  IMPLIK, 

BY  using  in.domain  as  rewrite  rule 
BY  using  valid_slice  as  rewrite  rule 
BY  using  low  in  trait  Vector  as  rewrite  rule 
BY  using  high  in  trait  Vectw  as  rewrite  rule 
BY  rewriting  (6),  simplificatitm 
BY  synthesis  of  TRUE 
AND  THEN 

BY  using  last_event_component 

in  trait  Bit_StableArray  with  (slicefx,  n-l,  0)  for  x,  a  for  a, 
i  for  i)  as  new  hypothesis 
BY  simplificaticm 
BY  synthesis  of  TRUE 
THEN 

BY  simplihcatim 
BY  synthesis  of  TRUE 

THEN 

BY  simplificatian 
BY  synthesis  of  TRUE 
THEN 

BY  simplificatiOT 
BY  synthesis  of  TRUE 


The  proof  presented  previously  is  very  ctxnplex  compared  with  the  proof  of  the  previous  examples.  The  ques¬ 
tion  is:  why?  It  is  very  difficult  to  answer  this  questicm  and  find  algorithmic  way  to  proof  similar  entity.  The 
(mly  method  to  be  used  in  such  a  case  is  to  formulate  a  diemy  for  the  above  proof  so  that  this  theory  becomes 
general  enough  to  be  applied  to  similar  cases  when  a  Bit-vector  is  used.  The  following  example  is  more  com¬ 
plex  and  show  the  iterative  form  of  a  four-bit  Adder  using  fw  loop  to  repeat  the  structure  of  the  four  full  adders. 


6.2  Example  of  Iterative  Four-Bit  Ripple  Adder  Using  For  Loop 


entity  add_4  is 

port  (  X,  y  :  Bit_vector  (3  downto  0);  ci ;  Bit_vector  (4  downto  0); 
s,  CO  :  out  Bit_vectOT  (3  downto  0)); 

end  add_4; 
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architecture  arch  of  add_4  is 
ccmponent  full_adder 

pOTt(x,y,ci ;  Bit;  s,co  ;  out  Bit); 
end  competent; 
begin 

d0to3  ;  fw  i  in  0  to  3  generate 

dO  :  full_adder  port  map(x(i).y(i).ci(i).s(i).co(i)); 
ci(i+l)  <=  co(i); 
end  generate; 
end  arch; 


entity  add_4 
includes  ( BitVector) 
guarantees  always 

forall  i :  Int ::  0  <=  i  and  i  <=  3  -> 

s[i]  =  (x[i]  xor  y[i]  xor  ci[i]) 
with  s  delayed  by  0  from  x.  y.  ci 
guarantees  always 

fixall  i ;  Int ::  0  <=  i  and  i  <=  3  -> 

co(i]  =  ( ( (x[i]  and  y[i])  or  (ci[i]  and  (x[i]  xor  y[i])))) 
with  CO  delayed  by  0  from  x,  y.  ci 
guarantees  always 

forall  i ;  Int 0  <=  i  and  i  <=  3  -> 
ci[i+l]  =  co[i] 

with  ci  delayed  by  0  frexn  co 
end 


The  proof  of  the  above  iterative  four-bit  adder  structure  is  more  complex  than  the  proof  of  the  simple  vector 

assignment  statement  shown  in  the  previous  Example.  It  is  also  possible  to  make  the  specification  of  four-bit 

adder  more  general  as  shown  below. 

entity  add4 

includes(Int4) 

guarantees 

fllwsys 

int4bit(s[3].s[2],s[l].s[0])  +  int(co)*2**4  =  int4bit(x[3]jc[2]jc[l]A[0])  -I-  int4bit(y[3].y[2].y[l].y[0]) 
with  s[3]^[2]\s[l]\s[0].  co  delayed  by  0  frcMn  x[0]jc[l]^[2]jt[3].y[0].yll].y[2],y[3] 
end 


The  specification  of  a  four-bit  adder  indepeixlent  of  the  architecture  Oc^cal  circuit  design)  which  make  the 
proof  more  complex  than  before.  Those  special  and  more  general  cases  needs  more  research  work  to  be  (kme  to 
reduce  their  complexity  [13-15]. 


7.  Formal  Methods  in  Education 

The  formal  verification  techniques  are  composed  at  a  very  high  level  of  abstraction  and  they  have  never 
been  used  on  a  wide  scale  to  verify  hardware  and  software  design  projects.  Very  intensive  research  projects  on 
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formal  methods  have  been  spOTSored  and  funded  by  the  DARPA  Infonnaticm  Technology  Office  (TTO)*  the  Air 
Force  Research  Facility  at  Rome  LaboraUMy,  and  NASA,  Also.  Rome  Laboratory  and  DARPA’s  ITO  have  esta¬ 
blished  the  21st  Century  Engineering  Consortium.  The  reason  for  establishing  the  consortium  is  to  promote 
leaching  of  formal  methods  at  all  academic  levels  (undergraduate  and  graduate)  so  that  the  engineering  capabil¬ 
ity  necessary  to  build  highly  assured  systems  can  be  develq^ed  This  means  that  DARPA  and  the  Air  FcHce 
would  like  to  see  a  much  larger  number  of  students  trained  nadcmwide  in  engineering  and  computer  science 
using  fcamal  methods  (examples:  hardware  verification  using  LarchAIBDDL,  software  verification  using  ADA,  C. 
C++  codes,  requirements/specification  analysis,  as  well  as  applying  this  technology  to  other  problem  domains). 
The  following  topics  are  the  main  themes  of  the  Consortium: 

(1)  Who  will  be  the  customer  for  our  fOTmal  method  products? 

(2)  How  will  we  educate  students  in  formal  methods? 

(3)  What  are  we  going  to  teach? 

(4)  How  will  we  achieve  our  goals? 

(5)  What  fcnmal  method  materials  must  be  taught  in  the  undergraduate  and  graduate  level  courses? 

(6)  How  can  formal  methods  be  integrated  into  existing  MATH/CS/CE  courses? 

A  model  course  in  Formal  Specification  and  Verification  with  Labtx^atory  at  the  undergraduate  level  using 
theorem  provers  has  been  suggested  as  shown  in  [16,17].  This  course  offers  a  new  q)proach  for  students  and 
teachers  alike  in  which  thecaem  provers,  like  LarchA^HDL,  will  be  used  to  help  students  structure  and  under¬ 
stand  the  proof  methodology.  A  training  session  which  will  focus  on  how  to  use  the  technology  (software  pack¬ 
ages  selected)  and  how  to  formulate  problems  (waiting  correct  specifications)  so  that  the  theorem  prover  can 
proceed  to  proof  its  c<xrectness  was  also  suggested.  This  teaching  methodology  suggested  in  [16,17]  will 
encourage  fk:ulty  members  to  introduce  formal  verification  and  proof  methodology  into  their  courses  and  then  a 
wide  range  of  undergraduate  students  will  be  trained  and  equipped  to  deal  with  more  complex  verificaticm 
provers  in  the  future.  The  21st  Century  Engineering  (Zcaisortium  announced  its  first  wcricshop  which  will  be 
held  in  Melbourne,  Rorida,  March  18-19,  1998.  This  woricshop’s  principal  ccmcem  is  promoting  formal 
methods  in  computer  science  and  engineering  program  and  integrating  it  in  the  main  stream  curricula  of  these 
disciplines. 
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8.  Conclusion 


The  most  important  feature  of  the  Theorem  Prover  developed  by  ORA  is  its  capability  to  guide  the  user  to 
structure  the  proof  to  the  problem.  So  LarclVVHDL  theorem  prover  is  a  very  important  educational  tool  to 
teach  the  mathematical  foundation  to  the  student  and  encourage  them  to  proof  any  problem  if  it  is  possible  to 
write  its  specification  correctly.  That  means,  if  the  {woblem  is  formulated  and  its  specificatirai  is  written  in  the 
ccKiect  way.  The  proof  structure  of  different  classes  of  logical  circuits  was  developed  in  an  algorithmic  way. 
The  complexity  problem  of  the  proof  of  certain  logic  structure  was  addressed  and  solution  was  suggested  to 
improve  the  perfonnance  of  the  theorem  prover.  By  providing  more  proved  theories  into  the  library  body  of  the 
existing  theorem  prover.  the  ccanplexity  of  the  proof  can  be  reduced  to  its  minimum.  It  was  expected  that  for¬ 
mal  method  for  specification  and  verificaticm  of  both  hardware  and  software  projects  will  overcome  its 
difficulties  and  dominate  the  field  of  producing  reliable  and  secure  systems. 
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Abstract 


In  this  document  we  formally  specify  and  verify  a  part  of  the  Multilevel  Information  System 
Security  Initiative  (MISSI).  MISSI  is  a  National  Security  Agency  (NSA)  program,  designed  to  send 
protected  messages  over  unprotected  networks  such  as  Internet.  MISSI  uses  several  kinds  of  cryptog- 
raphv  for  protecting  the  messages.  Cryptography  is  accomplished  using  a  credit  card  sized  Personal 
Computer  Memory  card  Interface  Association  (PCMCIA)  card  called  the  FORTEZZA  Crypto  Card 

(or  "Card”,  for  short).  wtcct  a 

We  constructed  a  formal  specification  of  receiving  and  forwarding  e-mail  using  MISSI.  We  used 
ACP  123,  X.400  and  SDN  701  standards  to  specify  message  structure  and  MISSI  access  and  for¬ 
warding  policy.  We  used  formal  language  called  Promela,  based  on  Hoare’s  CSP.  We  verified  the 
model  using  automated  model  checker  SPIN,  developed  by  AT&T. 
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FORMAL  SPECIFICATION  AND  VERIFICATION  OF 
MISSI  RECEIVER  AND  FORWARDING  USING  SPIN 


Milica  Barjaktarovic 


1  Introduction 

The  Multilevel  Information  System  Security  Initiative  (MISSI)  is  a  National  Security  Agency  (NSA) 
program,  designed  to  send  protected  messages  over  unprotected  networks  such  as  Internet  [2].  New 
releases  of  MISSI  are  posted  in 

http : //www . armadillo . hnntsville . al . us. 

MISSI  employs  cryptography  by  using  a  credit  card  sized  Personal  Computer  Memory  card 
Interface  Association  (PCMCIA)  card  called  the  FORTEZZA  Crypto  Card  (or  “Card,”  for  short). 
MISSI  Forteza  cards  handle  Sensitive  but  Unclassified  (SBU)  messages,  and  FORTEZZA  Plus  Cards 
handle  Secret  (S)  messages.  MISSI  uses  a  combination  of  private  and  public  cryptography.  Users’ 
public  cryptography  information  is  posted  in  the  public  distributed  database  called  the  Directory, 
in  files  called  certificates  [5,  12]. 

MISSI  messages  can  travel  using  X.400  or  MIME  mail.  MISSI  messages  which  travel  using  X.400 
must  be  sent  as  military  messages  (MMs).  MMs  cai^be  individual  or  organizational.  Organizational 
messages  commit  organizations,  and  require  special  security  concerns.  For  example,  if  A  sends  an 
organizational  military  message  to  B,  can  B  forward  it  to  a  non-organizational  user? 

From  the  engineering  perspective,  the  two  basic  questions  are: 

•  how  do  we  design  and  build  MISSI  receiver  and  forwarding  policy? 

•  how  do  we  know  that  it  works  correctly? 

In  this  report,  we  will  discuss  the  above  questions.  This  report  is  a  continuation  of  our  work  presented 
in  [6]  and  [7]. 

The  basis  of  our  work  is  application  of  a  mathematically  based  formal  method  to  specify  and 
verify  local  cashe  management  protocol,  in  order  to  gain  better  understanding  of  protocol  design  by 
formal  reasoning  and  proofs. 

The  goals  of  our  work  are: 

•  help  the  designers  of  complex  systems  such  as  MISSI  comprehend  and 
design  the  system  in  an  organized  way 

•  help  the  communication  between  teams  designing  and  building  various 
parts  of  the  system 
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♦  eventually  hand  the  verified  formal  specification  to  the  programmers  and 
builders;  based  on  their  input,  change  the  specification,  and  repeat  this 
step. 

We  chose  formal  language  Promela,  based  on  Hoare’s  CSP,  to  specify  and  test  MISSI  receiving 
and  forwarding.  Promela  is  a  language  based  on  CSP  and  implemented  in  C.  SPIN  is  the  publicly 
distributed  automated  verification  tool  which  supports  Promela  [8,  10,  9,  15]. 

We  have  cited  various  documents  throughout  this  report,  and  included  our  comments  in  italics. 

2  Overview  of  MISSI 

MISSI  provides  the  following  security  services:  data  integrity  and  authentication  (via  hashing,  digital 
signatures,  and  certificate  authentication);  confidentiality  (via  public  and  secret  cryptography);  non¬ 
repudiation  with  proof  of  origin;  and  non-repudiation  with  proof  of  receipt  (optional).  Users  are 
authorized  by  certification  authorities,  which  issue  certificates  (“tickets”)  to  the  users. 

In  order  to  illustrate  how  MISSI  works,  we  will  assume  that  User  A  wants  to  send  e-mail  to 
User  B.  A  will  login  with  his  Card,  supply  his  PIN  number,  write  a  text  message,  and  his  Card  will 
encrypt  the  message  using  a  secret  key  and  sign  the  message.  The  secret  key  is  encrypted  using 
public  key  cryptography  and  sent  in  the  header  of  the  message.  Receiver  will  extract  the  secret  key 
using  public  decryption,  and  decrypt  the  message.  .Receiver  will  also  verify  sender’s  signature,  and 
signatures  of  authorities  that  signed  sender’s  signature. 

The  Card  is  contacted  via  SDN.701  Message  Security  protocol.  The  process  of  sending  out  a 
message  to  the  Internet  involves  using  X.400  or  RFC1521  message  formatting  mechanisms.  X.400  is 
a  suite  of  standards  for  message  formatting  and  transfer.  Each  workstation  has  application  software 
called  User  Agent  (UA),  which  sends  and  receives  messages. 

Public  keys  are  posted  in  the  MISSI  Directory.  Each  user’s  private  keys  and  PIN  are  stored  on  the 
user’s  Card  and  non-readable  by  the  user.  X.500  is  a  suite  of  standards  for  the  distributed  Directory 
service.  X.500  Directory  is  a  distributed  “white/yellow  pages”  repository  of  public  keys.  The  MISSI 
Directory  is  the  X.500  Directory  equipped  with  a  FORTEZZA  card.  UA  contains  Directory  User 
Agent  (DUA),  which  contacts  the  Directory  through  Directory  Service  Agent  (DSA). 

Public  keys  are  stored  in  certificates.  A  certificate  is  a  data  structure  which  includes  each 
user’s  name  and  public  keys,  signed  by  the  authority  that  issued  the  certificate.  Certificates  and 
keys  can  expire,  get  compromised  in  some  way,  or  be  revoked.  Authorities  keep  invalid  certificates’ 
serial  numbers  in  Certificate  Revocation  Lists  (CRLs).  Authorities  also  keep  Key  Revocation  Lists 
(KRLs).  X.509  standard  specifies  the  certification  hierarchy  and  certificate  and  key  management. 

On  top  of  the  hierarchy  is  the  Policy  Approving  Authority  (PAA),  which  is  the  trusted  authority. 
PAA’s  signature  is  stored  on  each  user’s  Card,  and  used  for  the  final  authentication.  PAA  issues 
certificates  for  Policy  Creation  Authorities  (PCAs),  but  cannot  revoke  any  PC  A  and  does  not  keep 
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Certificate  Revocation  List  (CRL)  for  PCAs.  Each  PCA  issues  certificates  for  Certification  Author¬ 
ities  (CAs)  in  its  domain,  and  maintains  a  CA  CRL.  It  also  maintains  a  CA  KRL,  and  distributes 
it  to  all  CAs  in  its  domain.  PCA  will  post  signed  CRL  and  KRL  to  the  Directory.  Each  CA  issues 
user  certificates,  and  maintains  and  posts  user  CRL  to  the  Directory. 

PAA,  PCA  and  CA  operate  through  a  CA  workstation  (CAW)  which  can  be  set  up  to  pro¬ 
vide  PAA,  PCA  or  CA  capabilities.  CAW  has  a  trusted  operating  system,  and  is  connected  to  a 
FORTEZA  device. 

In  Figure  1  we  show  the  overall  MISSI  architecture  and  identify  MISSI  components.  The  figure 
represents  an  enclave  consisting  of  a  Local  Area  Network  (LAN)  with  MISSI  components,  attached  to 
the  Internet  via  a  Secure  Network  Server  (SNS).  SNS  ensures  that  messages  of  appropriate  security 
levels  leave  and  enter  the  enclave.  We  are  assuming  the  enclave  is  classified  as  Secret.  Sensitive 
but  Unclassified  enclaves  can  have  a  commercial  firewall  instead  of  SNS  [13].  Other  enclaves  are 
attached  to  the  WAN.  but  we  do  not  show  them  to  conserve  the  space. 

Workstations  are  equipped  with  either  FORTEZZA  Cards  (F)  or  FORTEZZA  Plus  cards  (FP), 
or  they  have  no  cards.  Users  are  certified  by  the  Certification  Authority  (CA)  operating  at  the  CA 
workstation  (CAW).  Audit  Manager  (AM)  is  used  for  auditing  the  system.  Mail  List  Agent  (MLA) 
is  used  to  forw’ard  mail  to  a  list  of  recipients.  Rekey  Manager  (RKM)  is  used  to  rekey  the  cards. 


Figure  1:  MISSI  Components 


3  MISSI  Workstation 

The  abstract  view  of  the  interface  between  a  MISSI  workstation  and  FORTEZZA  card  consists  of 
three  interacting  units:  UA,  MSP  functions,  and  the  Card.  UA  application  process  contacts  the 
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hardware  on  the  Card  through  MSP  application  software.  MSP  application  software  represents  a 
high-level  interface  to  the  Card  and  consists  of  a  library  of  eight  msp.  functions.  These  functions  call 
on  the  library  of  fifty  one  Crypto  Interface  (Cl)  Cl-  functions,  which  interface  with  the  card  device 
driver  and  eventually  with  the  hardware.  The  interfaces  are  shown  in  Fig.  reffig:3-rings. 


Figure  2:  Abstract  View  of  MISSI  Workstation  /FORTEZZA  Card  Interface 

In  Figure  3  we  show  the  software  components  within  a  MISSI  workstation  and  its  connection  to 
the  FORTEZZA  hardware. 


Overview  of  MISSI  Architecture 


Figure  3:  Local  Workstation  Components 


Certificate  validation  is  described  in  detail  in  [7]. 


User  workstation 


Figure  4:  MISSI  Send  Mail 

4  Message  Headers 

If  MISSI  uses  X.400  for  message  transport,  all  messages  must  be  formatted  as  Military  Messages 
(MMs),  which  are  lebeled  with  content  type  P772.  j:he  nesting  of  X.400,  MSP  and  P772  messages 
is  shown  in  Fig.  5. 

This  section  outlines  the  sections  of  RFC822/1521  and  MSP  headers  which  are  crucial  for  pro¬ 
cessing  of  received  and/or  forwarded  messages,  in  the  most  general  terms.  We  do  not  include  all 
header  fields,  for  the  sake  of  brevity. 

RFC822,  or  RFC  1521  message  contains  the  following  fields:  RFC822/1521  header,  and  body 
part  (which  is  ANS.l  encoded  MSP  message). 

RFC822/1521  header  information  is  taken  from  the  summary  provided  in  [3],  and  the  official 
MSP  header  specification  is  taken  from  [5]. 

RFC822/  1521  header  contains  the  following  fields: 

From:  sender  DN 

To:  receiver  DN 

Services  applied  (optional) 

Content  type:  multipart/X-MSP 

Preamble:  seems  like  it  does  not  contain  useful  MISSI  info 
Plain  text:  user-selectable  option,  if  message  not  encrypted 
Body  part: 

application  type:  MSP 

security  applied:  signed  /signed,  encrypted 
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X.400  ENVELOPE: _ 

PI  envelope 


MSP  CONTENT  TYPE: 


MSP  heading 


MSP  encapsulated  content  »  MSP  body  parts  1,  ...  N 


MM  heading  *  IPMS  Helds  +  MM  extensions 


body  port  1 

body  part  N 

_ 1 

MBP  body  part  1 
(MM  content  type;  ) 


MBP  body  part  N 
(mM  content  type;  ) 


Figure  5:  Message  Nesting 

ANS.l  encoded  MSP  message  consists  of  MSP  header  and  MSP  encapsulated  content.  It  is 
possible  that  the  header  and  the  encapsulated  content  are  signed  (this  signature  is  called  sequence 
signature,  and  the  message  is  called  a  signed  MSP  message).  msp_status  and  msp-deliver  process 
this  “MSP"  part  of  the  incoming  message. 

MSP  header  contains  the  following  fields:  MessageSecurityProtocol:  Msp  /  SIGNED  Msp 
“Msp"  field  is  a  sequence  of  the  following  fields: 

OriginatorSecurityData  (optional):  contains  information  needed  to  process  PerRecipientToken, 
i.e.  for  decryption.  This  information  will  be  either  pertaining  to  the  original  sender,  or  to  MLA  if 
MLA  processed  the  message. 

originatorCertificate  (optional):  this  field  is  required  according  to  FORTEZZA  require¬ 
ments  [4]. 

organizational  msg  flag  is  set  in  the  certificate 
additionalSecurityInfo  (optional):  contains  Irbac  information 

signatureBlock  (optional):  originator's  signature  information,  in  case  the  message  was  signed 
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recipientSecurityData:  set  of  PerRecipientToken  (optional): 

RecipientKey  Token: 

encapsulatedContentType: 

Security  Label: 

SecurityClassification  (optional):  U/  UBS/  C/  S/  TS 
SecurityCategory  (optional):  contains  prbac 

contentDescription 

mspSequenceSignatureCertificate  (optional):  certification  path  of  the  originator 
encapsulatedContent 

What  is  missing  here  is  the  information  about  the  security  level  of  the  originator's  enclave.  Some 
MISSI  decisions  are  based  on  the  level  of  the  originator's  and  recipient's  enclaves,  yet  this  infor¬ 
mation  is  not  earned  in  MSP  header.  Also,  security  classification  is  an  optional  field  in  SDN.  701, 
yet  some  MISSI  decisions  are  based  on  it.  Therefore,  we  conclude  that  MISSI  messages  must  have 
security  classification  specified. 

[^^]j  specifies  that  LAs  are  grouped  intoxi^sses  based  on  the  type  of  content  of  messages 
they  can  handle.  The  MTS  provides  a  UA  with  the  ability  to  identify  its  class  when  sending  messages 
to  other  UAs/’ 

5  AGP  123:  Military  Messaging 

Military  messaging  is  used  in  Defense  Messaging  System  (DMS)  [16],  [17],  All  Military  Messages 
(MMs)  must  have  security  service  applied  to  them. 

ACP  123  defines  extensions  to  IP  messaging  which  supports  Military  messaging.  “Unless  excep¬ 
tions  are  noted,  all  statements  which  apply  to  Interpersonal  Messaging  Services  (IPMS)  also  apply 
to  Military  Messaging  Services  (MMS)”  [17],  p.A-2.  Military  messaging  is  a  superset  of  interpersonal 
messaging.  “The  structure  of  military  messages  is  fundamentally  the  same  as  that  of  IP  messages. 
Additional  elements  in  the  heading  are  required  to  support  MM,  sa  shown  in  Fig.  5.  IP  messages 
(IPMs)  have  content  type  P22,  and  military  messages  (MMs)  have  content  type  P772”  [17],  p.A-5. 

MM-UA  must  keep  a  record  for  each  message.  For  incoming  messages,  the  following  record  is 
kept: 

•  security  label 

•  extended  authorization  information 
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•  ... 


•  message  originator 

For  outgoing  messages,  the  following  record  is  kept: 

•  security  label 

•  extended  authorization  information 


•  release  authority 

It  is  not  clear  from  ACP  123  standard  and  its  Annex  A  if  MMs  must  be  organizational,  or 
individual  messages  are  allowed  as  well.  Seems  like  ACP  123  allows  only  organization  messages,  and 
Annex  A  allows  individual  messages  as  well: 

“ACP  123  identifies  the  service  and  protocol  requirements  necessary  to  ensure  interoperability 
among  Military  Message  Handling  Systems  (MMHS)  for  military  message  traffic  that  formally  com¬ 
mits  an  organization  and  requires  authorized  release.  This  ACP  specifically  does  not  cover  messaging 
between  individuals”  [16],  p.1-1. 

However,  Annex  A  of  ACP  123,  aka  US  Supplement,  does  allow  individual  messaging:  “The 
Military  Messaging  Service  (MMS)  provides  an  elertronic  mail  facility  between  military  personnel 
and  military  organizational  users.  ... 

The  Military  Messaging  User  Agent  (MM-UA)  is  the  entity  that  represents  a  military  individual 
or  organization  with  the  MMS. 

The  MMS  protocol  supports  the  exchange  of  messages  between  military  individuals  or  organiza¬ 
tions.  The  nomenclature  for  this  protocol  is  P772”  [17],  p.A-9. 

The  question  is:  are  military  personnel  who  are  not  organizational  users  a  special  kind  of  indi¬ 
vidual  users? 

5.1  Individual  and  Organizational  Users 

There  are  two  classes  of  users  in  the  DMS:  individual  and  organizational.  These  two  kinds  of 
users  must  be  distinguished,  as  security  requirements  are  greater  for  organizational  users.  The 
organizational  or  individual  classification  is  assigned  trough  digital  signature  privileges  in  the  Digital 
Signature  Standard  (DSS)  key  material. 

Organizational  users  represent  an  organization  and  can  commit  organizational  resources.  Or¬ 
ganizational  messaging  is  defined  as  message  exchange  between  organizational  elements.  These 
messages  require  approval  for  transmission  by  designated  officials  at  the  sender  side,  and  determina¬ 
tion  of  distribution  by  the  receiver  side.  [5]  requires  all  MSP  UAs  to  “determine  whether  a  message 
is  an  organizational  message  and  unambiguosly  communicate  that  fact  to  the  user.  A  message  is 
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considered  an  organizational  message  iff  it  has  been  signed  by  a  user  with  a  certificate  designated 
as  an  organizational  certificate.”  Therefore,  we  conclude  that  organizational  users  must  not  send 
unprotected  messages;  and  that  organizational  users  must  be  at  least  SBU  MISSI  users. 

All  other  messages  are  classified  as  individual  messages. 

A  user’s  certificate  contains  the  following  fields  of  interest  to  us: 

KEA  Clearance  field,  containing  user  security  clearance  (U.  SBU,  S,  or  TS); 

DSS  Privileges  field,  containing  user’s  privileges  (organizational  authority,  PAA,  PCA,  PA,  or 
NoSignatureCapability/ReadOnly). 

6  Receiver  Requirements 

[4],  p.l2,  specifies: 

‘T2.  Upon  successful  verification  of  an  MSP  protected  message,  the  application  shall: 

a.  Display  to  the  recipient  the  originator’s  authenticated  DN. 

b.  Provide  the  recipient  with  the  option  to  display  the  entire  certificate  hierarchy  in  a  signature 
notification  report.  Which  certification  hierarchy  -  there  can  be  three  of  them  included  within  MSP 
header:  sender's,  SNSs  (for  sequence  signature),  and  MLAs  (for  encryption  token). 

c.  Display  to  the  recipient  the  MSP  security  label  of  an  encrypted  message  after  successful 
message  verification. 

The  application  shall  check  the  MSP  security  label  to  verify  that  both  the  originator  and  recipient 
have  that  clearance  authorized  in  their  certificates  before  displaying  the  message. 

If  the  security  label  check  fails,  the  user  shall  be  notified  that  the  message  clearance  is  unautho¬ 
rized,  message  processing  shall  stop  and  any  message  processing  data  shall  be  deleted  and  overwrit¬ 
ten. 

d.  Identify  the  message  as  organizational  to  the  recipient  if  the  Organizational  Releaser  privilege 
bit  is  set  in  the  originator’s  certificate. 

e.  Display  to  the  recipient  only  the  message  header  and  content  that  was  MSP  protected.  ” 

'T5.  Perform  MSP  sequence  signature  validation  prior  to  any  other  MSP  processing  when  re¬ 
ceiving  a  message  containing  a  sequence  signature. 

The  application  shall  notify  the  user  if  the  originator’s  DN  for  the  sequence  signature  is  not  the 
same  as  the  originator’s  authenticated  DN  for  the  message  signature. 

If  the  authenticated  DNs  are  different  then  the  application  shall  provide  the  recipient  the  option 
to  display  both  DNs  and  certificate  hierarchies.” 

“16.  Include  the  entire  user  to  PCA  certificate  path  in  MSP  header  by  default.”  Since  there  can 
be  up  to  three  certification  paths  in  a  message,  should  we  include  them  all  or  only  the  sender's  path? 

[1]  p.42  states  that:  “To  comply  with  SDN.701,  the  Application  (i.e.  UA)  must  unambiguously 
communicate  the  following  information  to  the  user: 
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1.  The  authenticated  identity  of  the  message  signer,  if  the  message  was  signed 

2.  The  authenticated  identity  of  the  message  encryptor  (i.e.  token  generator) 
this  could  be  either  the  sender  or  MLA  -  or  SNS?,  and  the  security  label 
if  the  message,  if  the  message  was  encrypted 

3.  any  errors  detected  during  MSP  processing.” 

7  Receiver  Algorithm 

MISSI  receiver  is  shown  in  Fig.  6. 


Figure  6:  MISSI  Receive  Mail 

From  [5],  [4]  and  [2],  we  specify  receiver  algorithm  as  follows: 

Receive  new  message 

UA  informs  the  user  about  the  received  message 
user  selects  message  for  processing 

UA  checks  if  the  complete  msg  is  signed; 
if  yes,  validate  sequence  signature 

UA  calls  msp_status  to  examine  the  message 
msp.status  returns  msg  parameters,  including: 
security  services  applied 
certification  paths:  sequence  signer’s 
message  signer’s 
message  encryptor ’s 
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message  classification 

message  content,  if  the  message  was  not  encrypted 

UA  verifies  each  certification  path  returned  by  msp^status 

UA  displays  to  the  user: 
content  description 
services  applied 

authenticated  identity  of  msg  signer 
authenticated  identity  of  msg  encryptor 
seciirity  label  (if  msg  was  encrypted) 

UA  calls: 

msp.deliver  to  process  incoming  msg;  or 
msp^val.receipt ,  to  process  incoming  receipt 

if  msg  was  encrypted,  msp.deliver  will: 
select  token 
perform  prbac 
if  prbac  passes, 

extract  Irbac  information  from  the  token 
perform  Irbac 
if  Irbac  passes, 

decrypt  msg  ^ 

create  signed-only  msg  version  for  forwarding/revalidation 

if  msg  was  signed,  msp^deliver  will: 
perform  signature  validation 
perform  receipt  request  processing 

receipt  may  be  returned  automaticadly  or  after  confirmation 
from  user 

If  msg  was  not  encrypted, 

UA  saves  it  for  forwarding/revalidation 

If  the  encapsulated  content  contains  forwarded  messages, 

UA  must  extract  them  and  manage  them  for  future  forwarding  and  revalidation. 

UA  disposes  message,  or 
retains  it  for  storage  and  revalidation: 
store  the  original,  or 
store  content  only,  or 

store  content  and  msg  signature,  for  forwairding  and  non-repudiation 

If  a  user  requests  that  the  message  encapsulated  by  a  stored  message  be 
displayed  but  not  validated, 

UA  calls  msp^status  to  provide  the  encapsulated  content 
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If  the  user  requests  that  the  message  be  validated, 
UA  calls  msp.deliver  or  msp.val^receipt 


7.1  msp^tatus 

mspjstatus  is  used  to: 

•  decode,  but  not  decrypt  incoming  message 

•  determine  security  services  and  message  attributes:  content  descryption, 
content  type  and  organizational  flag 

•  extract  certification  paths 

The  inputs  to  msp-5tatus  include: 

msp4>ath:  pointer  to  the  ASN.l  encoded  message 

content^ath:  pointer  to  the  file  in  which  to  store  encapsulated  content,  if  the  message  was  not 
encrypted 

The  outputs  of  msp-status  include: 
content^ath 

servjsel:  services  applied  to  the  message 

msg jLtts:  message  attributes:  content  descryption,  content  type  and  organizational  flag 
org_paths:  certification  paths  of  sequence  signer,  message  signer,  and  key  exchange  token  signer 
(i,e.  message  encryptor) 

7.2  msp-deliver 

mspjdeliver  is  used  to: 

•  provide  encapsulated  content  (i.e.  decrypted  message  if  the  message  was 
encrypted),  services  applied,  organizational  flag,  and  other  information 
for  incoming  messages,  after  the  messages  have  gone  through  msp-status. 
mspjdeliver  will  also  verify  sequence  signature  and  message  signature, 
and  partially  re-authenticate  certification  paths. 

•  create  signed-only  version  of  the  incoming  message  (to  be  used  for  for¬ 
warding  and  revalidation),  if  the  message  was  encrypted. 

•  create  signed  receipt  for  the  incoming  message,  if  requested 
The  inputs  to  mspjdeliver  include: 

msp_path:  pointer  to  the  pathname  for  the  incoming  ASN.l  encoded  message 
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cert  J.ndex:  certificate  number  for  the  user-selected  personality 

content-path:  pointer  to  pathname  in  which  msp^deliver  will  store  the  unencrypted  encapsu¬ 
lated  content 

org-paths:  pointer  to  originator  certification  path  structure: 
seq_cert_path:  pointer  to  the  certification  path  of  sequence  signer 
msg-.cert4>ath:  pointer  to  the  message  signer 

key.cert^ath:  pointer  to  the  message  encryptor  certification  path 

sign_cert-path:  pointer  to  the  file  which  will  contain  ASN.l  encoded,  MSP  formatted  signed 
receipt 

fwd4>ath:  pointer  to  the  file  which  will  contain  the  ASN.l  encoded,  MSP  formatted  message 
containing  the  signature  block  and  unencrypted  encapsulated  content  from  the  incoming  message, 
if  it  was  signed  and  encrypted. 

fwd_seq:  flag  indicating  if  the  signed-only  MSP  message  saved  for  forwarding  and  revalidation 
(in  fwd_path)  should  get  sequence  signature  applied  to  it. 

recjseq:  flag  indicating  if  sequence  signature  must  be  applied  to  the  signed  receipt  generated 
for  the  incoming  message. 

The  outputs  from  msp_deliver  include: 

serv...sel:  which  security  services  have  been  applied  to  the  message 
content-path:  unencrypted,  encapsulated  contgjit 

f  wd_path:  signed-only  version  of  the  message,  stored  for  forwarding/revalidation 
msg-atts:  message  attributes:  security  label,  content  description,  organizational  flag,  content 
type,  security  categories,  and  policy  ID. 

8  Forwarding 

[4]  p.i4  specifies; 

‘‘20.  Allow  the  user  to  forward  the  following: 

a.  Unprotected  messages 

b.  MSP  signed  messages  with  message  signature 

c.  The  verified  content  of  MSP  protected  messages.'’ 

Requirement  20. c  is  saying  that  a  user  is  allowed  to  forward  decrypted  contents  of  formerly 
encrypted  message,  without  any  signatures  to  prove  where  the  original  signed  and  encrypted  message 
came  from  (encrypted  messages  are  always  signed).  This  can  be  a  security  hole. 

21,  22.  Allow  the  user  to  receive,  review,  and  reply  to  unprotected  and  MSP  protected  messages. 
23.  Allow  the  user  to  receive  and  review  to  unprotected  and  MSP  protected  messages  that 
contain  forwarded  MSP  signed  messages  with  message  signature.” 
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Requirements  20  and  23  seem  to  contradict  each  other  -  a  user  is  allowed  to  forward  three  kinds 
of  messages,  yet  is  allowed  to  receive  and  review  only  one  kind  of  the  three. 

What  happens  if  user  A  forwards  to  user  B  an  unprotected  message  or  decrypted  contents  of 
formerly  encrypted  message.  According  to  requirement  23,  user  B  is  not  able  to  receive  and  review 
the  message. 

“24.  Be  capable  of  inserting  file  attachments  into  messages  that  will  be  unprotected  and  MSP 
protected(,)  and  extracting  file  attachments  from  unprotected  and  MSP  protected  messages.” 

There  are  no  other  restrictions  on  file  inserts.  Since  all  MISSI  messages  can  be  stored  as  the 
decrypted  content  only,  as  specified  in  clause  20.,  this  means  that  we  can  send  contents  of  any  MISSI 
message  as  a  file  inclusion  in  an  unprotected  message.  This  is  a  security  hole. 

“4.  Allow  the  user  to  change  Card  individual  and  organizational  personalities  during  a  session. 

a.  When  displayed  to  the  user,  the  application  shall  interpret  the  four  byte  certificate  Us¬ 
age/Equipment  specifier  as  follows:  “INKS”  as  “Individual,”  “INKX”  as  “Individual  read  only,” 
“ONKS”  as  “Organizational,”  and  “ORKX”  as  “Organizational  Read  Only.” 

b. 

c.  ...  ” 

“5.  Allow  user  to  logout  and  re- login  to  the  Card  at  any  time  without  having  to  restart  the 
application.” 

Each  user  has  only  one  PIN,  which  is  used  to  allow  access  to  FORTEZZA  Cards. 

Each  user  has  multiple  personalities,  and  each  personality  has  its  own  certificate  and  distinguished 
name  (DN). 

However,  [5]  p.ll  forwarding  specifications  conflict  with  [4]  clause  20:  “An  MSP  message  may 
be  retained  in  at  least  three  forms: 

•  original  form  (exactly  as  it  arrived  prior  to  any  processing) 

•  content  only  (without  any  MSP  heading  information) 

•  content  with  signature  (content  retaining  the  message  signature  informa¬ 
tion  to  provide  non-repudiation  and  to  support  forwarding).” 

[5]  specifies  forwarding  rules  as: 

•  any  number  of  forwarded  MSP  messages  may  be  conveyed  within  a  new 
message 

•  forwarded  MSP  messages  may  be  nested  within  one  another 

•  messages  that  can  be  forwarded  are: 

-  signed  receipts 
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-  the  original  unencrypted  content  and  the  original  MSP  signature 

-  encrypted  MSP  messages  (if  the  recipient  possesses  the  cryptographic 
material  required  to  decrypt  the  forwarded  MSP  message) 


[17]  p.  A-28  specifies  the  forwarding  rules  as: 

•  original  encrypted  content  type  and  envelope  information 

•  the  original  contents  and  delivery  information  (encrypted  messages  will 
be  decrypted  prior  to  forwarding).  Are  the  signatures  preserved?  Or  we 
send  the  original  content  without  any  signature? 

MSP  allows  for  forwarding  MSP  messages  within  a  MSP  message.  ACP  123  allows  for  forwarding 
MM  or  IPM  within  a  MM  message 

9  Secure  Network  Server  (SNS) 

Only  Secret  enclaves  are  required  to  have  SNS. 

[4]  p.7  says:  "SNS  for  Secret  High  Enclave  will: 

•  do  positive  identification  and  authentication'Tbr  messages  in  and  out  of 
the  enclave  in  accordance  with  local  security  policy 

•  check  for  proper  invocation  of  security  services  for  messages  leaving  the 
enclave  in  accordance  to  the  local  policy 

•  regrade  messages  according  to  the  local  policy: 

-  downgrade  messages  destined  for  lower  level  enclaves 

-  upgrade  messages  entering  from  lower  level  enclaves 

How  does  SNS  know  what  enclave  level  a  message  is  coming  from?  Mes- 
sage  headers  carry  information  about  individual  messages,  not  enclaves. 

The  above  requirement  is  not  clear  -  does  “destined  to  lower  level  enclave” 
means  that  a  message  is  leaving  the  present  enclave  and  going  to  a  lower 
level  enclave:  and  “entering  from  lower  level  enclave”  means  that  a  me5- 
sage  originated  at  a  lower  level  enclave  is  entering  from  WAN. 

•  reject  or  admit  unprotected  unclassified  messages  according  to  the  local 
policy.” 
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9.1  Omission  in  CONOP 


An  important  omission  is  present  in  [14]  p.23,  because  message  headers  carry  information  about 
individual  messages,  not  enclaves.  Therefore,  SNS  has  no  way  of  determining  the  security  level  of 
the  enclave  which  originated  the  incoming  message,  unless  such  information  is  stored  in  SNS.  SNS 
can  be  easily  configured  to  “know’’  the  security  level  of  the  enclave  it  belongs  to.  Security  levels  of 
ail  other  enclaves  would  have  to  be  stored  in  some  table  within  SNS.  Such  tables  would  be  analogous 
to  routing  tables  of  Internet  gateways.  However,  enclave  security  level  can  change  in  time,  and  there 
has  to  be  a  way  to  update  the  tables  of  all  SNSs.  Both  the  presence  of  tables  and  updating  process 
need  to  be  included  in  SNS  specification,  or  local  Directory  specification.  We  think  that  SNS  or 
local  DU  A  would  be  the  most  logical  place  to  include  enclave  security  level  information. 

This  omission  probably  happened  because  CONOP  includes  only  example  of  messaging  between 
Secret  High  enclaves. 

Enclaves  may  consist  of  workstations  of  various  security  levels:  “A  Secret  System  High  enclave 
contains  at  least  one  FORTEZZA  protected  user  workstation.  It  may  also  contain  workstations  that 
are  not  FORTEZZA  protected,  ..  ”  [14]  p.20.  MISSI  local  policy  depends  on  enclave  security  level, 
as  we  discussed  in  detail  in  section  10. 

[14]  Appendix  D,  states:  “When  MSP  encrypted/signed  messages  are  processed  at  SNS,  the  SNS 
will  remove  its  token  from  the  message  prior  to  delivering  the  message.  If  a  sequence  signature  has 
been  applied  to  the  message  by  the  originator,  then  this  outer  signature  is  invalidated  by  the  removal 
of  SNS  token. 

Thus,  the  operational  scenarios  assume  that  if  the  MSP  optional  sequence  signature  is  selected 
by  the  originator,  then  the  use  of  a  sequence  signature  on  that  specific  message  must  be  preserved 
throughout  the  transmission  from  writer  to  reader. 

Thus,  SNS  must  generate  a  new  sequence  signature  for  the  message  in  order  to  preserve  the  use 
of  sequence  signature.”  Is  this  SNS  on  sender's  or  receiver's  side? 

“Other  assumptions  are:  X.400  services  will  not  be  used  to  transfer  a  X.400  P772  message  from 
the  workstation  to  the  SNS.  SNS  has  MTA  functions  that  can  transfer  messages  (for  example,  PI 
envelope)  out  of  the  enclave.  These  assumptions  are  based  on  understanding  that  current  MTA 
products  do  not  completely  support  the  MTA  functionality  (e.g.  P3  submit  envelopes)  specified  in 
X.400.  With  the  MTA  functionality  in  the  SNS,  multiple  MTAs  will  not  be  required  in  the  enclave.” 

As  an  outbound  filter,  i.e.  when  checking  messages  which  are  leaving  the  (Secret)  enclave,  SNS 
will  pass  messages  according  to  the  following  criteria  [14]  p.73;  all  S  messages  must  be  encrypted; 
all  messages  must  be  signed. 

SNS  will  also  check  if  the  message  originator  is  allowed  to  send  messages  with  security  level  lower 
than  the  enclave  security  level. 

As  an  inbound  filter,  i.e.  when  checking  messages  which  are  entering  the  enclave,  SNS  will 
accept  messages  according  to  one  of  the  following  three  policies  [14]  p.79:  messages  must  be  signed: 
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or  messages  must  be  encrypted;  or  any  messages  are  accepted. 


10  Relationship  Between  MISSI  Certificates,  Workstations, 
Message  Security  Levels  and  User  Authority  Properties 

In  order  to  formalize  MISSI  access  policy,  we  put  together  a  more  formal  definition  of  terms  and 
rules  scattered  throughout  MISSI  documentation. 

The  term  FFC  means  “FORTEZZA  for  Classified.” 

Each  user  has  one  PIN  number  that  allows  access  to  his  FORTEZZA  card. 

Each  user,  or  PIN,  has  several  personalities. 

Each  personality  has  its  own  DN  and  certificate  stored  on  the  card. 

The  relationship  between  user,  Card,  personalities  and  certificates  is  shown  in  Fig.  7. 


PERSONALITY  — ^  DN-—  CERTIFICATE 


USER  —  PIN  —  CARD---^  PERSONALITY  —  DN—  CERTIFICATE 


‘  PERSONALITY  —  DN—  CERTIPTCATE 


Figure  7:  Relationship  Between  User  and  His  Personalities 

When  a  user  logs  into  his  Card  using  a  chosen  personality,  the  Card  will  perform  all  operations 
from  that  point  on  using  the  certificate  associated  with  the  chosen  personality. 

Once  a  user  is  logged  into  Card,  he  can  change  personalities  any  time  during  a  session. 

User  can  logout  and  re- login  to  the  Card  without  logging  out  of  the  host  [4],  p.2.  This  require¬ 
ment  could  imply  that  a  non-FORTEZZA  user  can  login  into  a  workstation  and  use  the  workstation 
as  a  plain-vanilla  workstation,  if  there  is  no  additional  mechanism  to  restrict  workstation  use  to 
FORTEZZA  users  only.  This  could  be  a  serious  security  hole  in  case  of  Secret  workstations. 

There  are  separate  certificates  for  SBU  and  S  personalities.  SBU  and  S  certificates  are  distin¬ 
guished  by  cryptographic  universals  used  to  create  the  certificate.  X.509  certificates  for  Secret  are 
only  capable  of  encrypting  messages  for  users  also  possessing  an  X.509  certificate  for  Secret  [14]  p.28. 
Therefore,  SBU  certificate  cannot  be  used  to  decrypt  information  encrypted  by  S  certificate,  and  vice 
versa. 

The  latest  CON  OP  [14]  is  not  very  precise  in  terms  of  certificate  requirements.  It  says:  “If  a  S 
user  requires  the  capability  to  originate  traffic  destined  for  an  SBU  user  outside  of  the  S  enclave  or 
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to  receive  messages  from  an  SBU  only  user,  then  the  user  FORTEZZA  card  must  host  two  X.509 
certificates:  one  for  encrypting  S  traffic  and  one  for  encrypting  SBU  traffic.”  “SBU/Secret  Dual 
Mode  Card  contains  one  or  more  X.509  certificates  for  Secret  as  well  as  one  or  more  X.509  certificates 
for  SBU.  This  card  is  capable  of  originating  and  receiving  both  Secret  and  SBU  messages  depending 
on  the  classification  of  the  data  within  the  message  and  the  capabilities  of  message  recipient.” 
We  assume  that  this  means  that  ihe  S  user  must  have  two  personaliiies:  S  and  SBU,  and  switch 
personalities  when  he  or  she  needs  to  send  or  receive  encrypted  messages.  Perhaps  this  is  why  a  card 
allows  personality  switch  within  a  login  session.  Another  interpretation  of  ihe  above  requirement 
would  be  that  the  dual  mode  card  automatically  switches  personalities  when  needed;  for  example,  a  S 
user  receives  a  message  from  SBU  user,  so  the  receix^er^s  card  automatically  uses  the  SBU  certificate 
associated  with  ihe  S  user  -  but  this  interpretation  does  not  make  sense  from  the  security  standpoint. 

“Since  the  universal  within  the  signature  component  of  the  FFC  X.509  certificate  is  the  same 
as  in  the  certificate  for  Secret  and  certificates  for  SBU,  users  with  only  one  certificate  (e.g.  DSA 
with  only  a  certificate  for  SBU)  are  capable  of  verifying  signatures  of  directory  accesses  using  either 
the  certificate  for  Secret  or  the  certificate  for  SBU.”  [14]  p.28.  This  requirement  does  not  make 
sense  if  read  literally,  because  it  says  that  a  user  possessing  only  SBU  certificate  can  use  either  SBU 
or  S  certificate  -  but  it  is  stated  that  the  user  has  only  SBU  certificate  and  therefore  cannot  use  S 
certificate  which  he  does  not  posses.  However,  if  we  try  to  interpret  this  requirement,  we  can  assume 
that  it  means:  Users  with  only  one  certificate  are  capable  of  verifying  signatures  signed  using  either 
S  or  SBU  certificate.  Therefore,  signatures  signed  using  S  or  SBU  certificates  can  be  verified  using 
either  S  or  SBU  certificates.  In  other  words,  S  personality  can  read  SBU  signature,  and  vice  versa. 

“Certificate  for  S  has  S,  SBU  and  U  privileges,  and  certificate  for  SBU  has  SBU  and  U  privileges” 
[14]  p.61.  We  think  that  this  statement  means  that  personalities  with  S  certificates  are  allowed  to 
send  messages  classified  as  S,  SBU  or  U;  and  personalities  with  SBU  certificates  are  allowed  to  send 
messages  classified  ais  SBU  or  U. 

There  are  separate,  dual  certification  hierarchies  for  S  and  SBU  certificates.  Therefore,  there  are 
PAA,  PCAs  and  CAs  for  SBU  certificates,  and  PAA,  PCAs  and  CAs  for  S  certificates  [13]  p.l6. 

There  are  several  classes  of  FORTEZZA  cards:  SBU  only,  which  contain  only  SB  certificates;  S 
only,  which  contain  only  S  certificates;  dual,  which  contain  SBU  and  S  certificates;  SBU  traveling, 
and  compartmented  only  card.  Both  dual  and  Secret  only  cards  are  called  FFC  cards  [14]  p*4/. 

Each  workstation  has  its  own  level  of  security  level  (we  conclude  this  from  SDN. 701,  p.8).  We 
assume  this  security  level  must  be  based  on  which  FORTEZZA  card  reader  is  connected  to  the 
workstation:  U  if  there  is  no  card  reader,  SBU  if  there  is  FORTEZZA  for  Sensitive  But  Unclassified 
Card  reader  (FFS),  and  S  if  there  is  FORTEZZA  for  Classified  (FFC)  Card  reader.  We  see  from 
[14]  p.46  that  CA  workstation  can  have  up  to  three  PCMCIA  slots:  one  slot  for  the  Dual  Use 
Cards  (S/SBU),  one  slot  for  S  Cards  only,  and  one  slot  for  SBU  Cards  only.  We  assume  that  other 
workstations  have  the  same  configuration. 
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Enclave  security  level  is  equal  to  the  highest  security  level  of  the  workstations  which  form  the 
enclave.  An  enclave  can  contain  workstations  of  lower  security  level.  For  example,  an  enclave  is 
proclaimed  Secret  if  it  contains  at  least  one  FFC  user  workstation. 

A  Secret  enclave  can  contain  FFS  workstations  as  well  as  workstations  with  no  FORTEZZA 
Card  readers.  We  conclude  this  from  the  sentece:  “A  Secret  System  High  enclave  contains  at 
least  one  FORTEZZA  protected  user  workstation.  It  may  also  contain  workstations  which  are  not 
FORTEZZA  protected,  ..”  [14]  p.  20. 

Currently,  we  do  not  see  anywhere  in  MISSI  documents  that  workstations  are  aware  what  is  the 
security  level  of  the  enclave  they  are  in. 

Each  personality  is  free  to  log  into  any  workstation  which  is  connected  to  a  FORTEZZA  Card 
reader  of  the  level  higher  or  equivalent  to  the  personality’s  security  level.  For  example,  S  personality 
cannot  login  into  an  SBU  capable  card  reader.  (Because  S  and  SBU  certificates  are  different). 

We  assume  that  a  card  reader  of  security  level  X  can  read  cards  of  security  level  X  and  below. 

It  follows  that  a  Secret  personality  must  use  a  FFC  capable  workstation,  therefore  it  must  operate 
from  a  Secret  enclave. 

[14]  specifies  an  access  policy  between  users  in  Secret  enclaves  and  users  at  other  enclaves.  An 
example  is:  “Between  Secret  enclaves:  signed  and  encrypted  unclassified  through  Secret  (U-S)  mes¬ 
sages  originated  by  users  at  FFC  workstations  in  Secret  enclaves  for  recipients  at  FFC  workstations 
in  Secret  enclaves.”  This  requirement  is  ambiguous.  The  questions  we  need  to  ask  are: 

•  the  word  “user”  should  be  replaced  with  the  word  “personality.” 

•  Is  it  possible  to  log  into  a  workstation  if  a  user  is  not  a  FORTEZZA 
user,  or  has  authorization  level  less  than  the  Card  reader  level  for  that 
workstation?  That  is,  can  a  SBU  personality  use  the  workstation  which 
has  FFC  Card  reader  connected  to  it?  If  it  is  possible,  than  the  above 
requirement  must  be  modified.  The  above  requirement  is  valid  only  if 
Secret  personalities  exclusively  can  use  a  FFC  workstation. 

From  the  MISSI  documentation,  it  is  not  clear  if  the  above  holds.  How¬ 
ever,  from  talking  with  Chet  Hossmer  from  ORA,  who  worked  on  MISSI 
implementation,  the  above  requirement  holds;  therefore,  the  CONOP  is 
incorrect. 

Therefore,  we  conclude  that  MISSI  access  policy  has  to  be  developed  as  a  matrix  which  takes  into 
account  security  levels  of:  enclave,  workstation,  and  personality  currently  used,  as  shown  in  Fig.  9. 
The  matrix  needs  to  be  filled  with  the  allowed  message  characteristics:  message  classification  and 
security  service  applied  to  the  message.  We  filled  the  matrix  with  a  security  policy  we  proposed  and 
tested  in  our  model.  We  summarized  various  security  levels  in  section  11. 


3-21 


MESSAGE  SECURITY  LEVEL 


SECURITY  SERVICE 
APPLIED 

U 

SBU 

S 

Vanila  workstation 

UNPROTECTED 

(UP) 

"  y 

12 

7 

13 

X 

SIGNED  ONLY 
(SO) 

■'y 

“  y 

23 

X 

FOTREZZA  workstation 

SIGNED  AND 
ENCRYPTED 
(SE) 

31 

y 

32 

y 

33 

vanila  user 

SBU  user 


S  user 


POSSIBLE  COMBINATION 

X  :  IMPOSSIBLE  (I  E.  FORBIDDEN)  COMBINATION 
f?  NOT  CLEAR  IF  IT  IS  A  POSSIBLE  COMBINATION 


Figure  8:  MISSI  Message  Security  Classifications 

Therefore,  we  formalize  the  acces  policy  as  follows:  Any  personality  of  security  level  X  is  allowed 
to  send  email  to  any  other  personality  of  security  level  Y,  if: 

♦  the  messages  sent  are  classified  at  the  security  level  Z,  where  Z<min(X,Y); 
and 

•  the  messages  sent  have  security  service  VV  applied  to  them,  where  W  is 
determined  as: 

-  we  do  not  know  if  SBU  enclaves  are  allowed  to  send  unprotected 
messages  to  the  WAN  -  this  is  not  discussed  in  [14]. 

~  all  messages  classified  as  Secret  must  be  signed  and  encrypted;  i.e. 
W=signed+encrypted  if  msgJevel==S. 

-  Secret  enclaves  cannot  send  unprotected  messages. 

-  Secret  enclaves  cannot  receive  unprotected  messages.  That  is,  W>signed 
if  wkst Jevel==S.  In  other  words,  Secret  enclaves  must  send  and  re- 
ceive  signed  or  signed-and-encrypted  messages.  This  requirement  is 
enforced  by  SNS  filtering  [14].  p.ll. 
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This  definition  can  cause  problems.  For  example,  a  SB  U  personality 
can  be  working  from  a  Secret  enclave,  from  a  SB U  capable  worksia- 
tion.  Currently,  we  do  not  see  anywhere  in  MISSI  documents  that 
workstations  are  aware  what  is  the  security  level  of  the  enclave  they 
are  in.  This  personality  could  send  an  unprotected  unclassified  mes¬ 
sage  to  a  user  on  a  vanilla  enclave  or  workstation. 

We  conclude  that  all  workstations  in  the  Secret  enclave  must  have  the 
same  UA  implementation,  which  does  not  allow  unsigned  messages. 
However,  in  a  Secret  enclave  we  can  have  vanilla  workstations  and 
non-MISSI  users,  which  can  send  unprotected  messages.  The  only 
way  to  prevent  non-protected  messages  leaving  a  Secret  enclave  is  to 
have  the  Guard  know  enclave  level  and  prevent  unauthorized  messages 
from  leaving  the  enclave. 

[11],  p.l4,  specifies  that  “UAs  are  grouped  into  classes  based  on  the 
type  of  content  of  messages  they  can  handle.  The  MTS  provides  a 
UA  with  the  ability  to  identify  its  class  when  sending  messages  to 
other  UAs.” 

This  requirement  means  that  users  at  vanilla  workstations  in  Secret 
enclaves  cannot  send  anything  outsideHhe  enclave  (provided  that 
SNS  filtering  works). 

“  vanilla  workstations  can  receive  only  unprotected  messages,  because 
they  do  not  have  any  FORTEZZA  capability. 

The  last  two  requirements  clash,  because  if  a  Secret  enclave  sends 
a  signed  message  to  an  unprotected  enclave,  the  unprotected  enclave 
cannot  verify  the  signature,  because  an  unprotected  enclave  does  not 
have  a  Card  reader  yet  it  needs  msp.check-sign  function  from  the 
Card  to  verify  the  signature. 

An  old  version  of  CONOP  proposes  that  secret  enclaves  can  send  un¬ 
protected  messages  to  SNS,  which  will  archive  the  message  content 
for  future  audit.  The  new  CONOP  [14]  addresses  this  issue  implic¬ 
itly.  It  assumes  that  a  Secret  enclave  sends  a  signed  message  to  an 
unprotected  workstation,  and  that  this  message  consists  of  two  iden¬ 
tical  parts,  one  with  the  signature  and  another  with  no  protection. 
SNS  will  verify  the  signature  and  assure  that  the  two  message  parts 
are  identical  [If]  p.7.  72.  We  conclude  that,  in  this  case,  the  SNS 
on  the  receiver's  side  must  know  the  capabilities  of  the  recipient,  and 
remove  the  signature  if  the  recipient  has  no  Card  reader. 


3-23 


The  above  discussion  brings  us  to  formalize  one  additional  require¬ 
ment: 

-  secret  enclaves  can  communicate  with  vanilla  workstations  only  if  the 
vanilla  workstations  are  on  an  enclave  with  SNS. 

Currently,  only  Secret  enclaves  are  obliged  to  have  SNS. 

If  a  Secret  workstation  sends  a  signed  message  to  a  vanilla  worksta¬ 
tion  which  is  not  in  an  enclave  with  SNS,  the  receiver  can  be  used 
to  collect  messages  from  the  secret  enclave,  which  presents  a  security 
hole. 

Secret  Messages  between  a  Secret  personality  and  any  other  personality  must  be  signed  and 
encrypted.  SBU  Messages  between  a  Secret  personality  and  any  other  personality  are  either  signed 
and  encrypted,  or  signed  only. 

11  Security  Levels 

In  this  section  we  will  clarify  various  security  levels:  enclave,  card,  workstation,  user,  certificate,  and 
message. 

11.1  Enclave  Security  Level  ^ 

If  an  enclave  E  consists  of  workstations  Wl,  W2,  ...  Wn,  and  the  workstations  have  security  levels 
SI,  S2,  ...  Sn  respectively,  then  the  enclave  E  has  security  level  SE,  where  SE=max(Sl,  S2,  ...  Sn). 

11.2  Workstation  Security  Level 

Workstation  security  level  is  determined  by  the  capability  of  the  Card  reader  attached  to  the  work¬ 
station.  [14]  mentions  different  “slots”  on  card  readers.  Each  “slot”  accepts  Cards  of  certain  security 
level,  a  Card  reader  can  have  the  following  slots:  a  slot  capable  of  reading  only  SBU-only  Cards, 
a  slot  capable  of  reading  only  S-only  Cards,  and  a  slot  capable  of  reading  Dual-use  SBU/S  cards 
(these  Card  readers  are  called  Dual  SBU/S  Card  readers).  From  the  wording  used  in  [14]  p.43  and 
p.46,  it  seems  that  a  card  reader  with  a  Dual  slot  can  have  another  slot  used  only  for  S-only  or 
SBU-only  cards. 

11.3  Card  Security  Level 

Card  security  level  is  determined  by  the  security  level  of  certificates  and  cryptographic  and  signature 
verification  material  stored  on  the  Card. 
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There  are  three  kinds  of  cards;  S-only,  SBU-only,  and  Dual  S/SBU.  S-only  Cards  contain  only  S 
certificates  and  materials,  SBU-only  Cards  contain  SBU  certificates  and  materials,  and  Dual  S/SBU 
Cards  contain  both  S  and  SBU  certificates  and  materials. 

Each  card  belongs  to  a  particular  user.  A  user  can  own  several  certificates,  which  are  stored  on 
the  user’s  Card.  The  user  unlocks  the  Card  by  supplying  his  PIN. 

11.4  Certificate  Security  Level 

There  are  two  kinds  of  certificates:  SBU  and  S.  Certificate  for  S  has  S,  SBU  and  S  privileges  (i.e. 
can  perform  security  services  on  messages  classified  as  S,  SBU  or  U),  and  certificate  for  SBU  has 
SBU  and  U  priviledges. 

Certificates  are  also  classified  as  organizational  or  personal. 

Based  on  certificate  security  and  organizational  classification,  the  user  can  or  cannot  perform 
certain  operations,  based  on  local  security  policy  and  other  MISSI  requirements. 

SBU  and  S  certificates  have  different  cryptographic  material,  so  the  sender  and  the  receiver  must 
use  exclusively  either  S  or  SBU  certificates  in  order  to  communicate  encrypted  messages. 

SBU  and  S  certificates  have  the  same  signature  verification  material,  so  SBU  and  S  certificates 
are  interchangeable  for  signature  verification. 

11.5  User  Security  Level  - 

User  security  level  is  determined  during  a  MISSI  session  by  the  type  of  Card  this  user  possesses  and 
the  certificate  the  user  is  currently  using.  Each  user  can  have  several  identities,  called  personalities, 
where  each  personality  owns  a  certificate  and  has  a  certain  security  level  and  organizational  status 
assigned  to  it.  For  example,  Joe  Schmoe  can  have  a  Card  with  two  certificates:  S  organizational 
certificate  as  Joe  Schmoe  the  Division  Director,  and  SBU-only  personal  certificate  as  Joe  Schmoe  an 
employee.  Certificates  are  stored  on  the  users  personal  Card  (and  unlocked  via  the  user’s  personal 
PIN).  Each  user  can  switch  personalities  during  a  MISSI  session. 

11.6  Message  Security  Level 

Message  security  level  is  determined  by  the  local  security  policy.  Either  the  user  is  allowed  to  assign 
message  security  level,  or  UA  assigns  it  based  on  the  local  security  policy.  For  example,  the  local 
UA  can  assign  the  message  security  level  based  on  the  workstation  or  enclave  security  level  [14]  p.59. 

ACP  123  specifies  that  each  military  message  must  have  (a  clearly  displayed)  security  classifica¬ 
tion.  [16]  p.4-9  determines  the  message  security  label  as;  “The  security  label  assigned  to  the  entire 
message  will  be  that  of  the  highest  classification  of  any  part  of  the  message  or  the  appropriate  label 
for  the  aggregate  of  the  information  contained  in  the  entire  message  including  all  body  parts.”  Each 
body  part  can  be  a  forwarded  message. 
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11.6.1  Body  Parts 

If  a  message  M  consists  of  message  text  T  and  several  body  parts  Bl,  B2,  ...  Bn,  and  if  the  user 
assigns  security  level  ST  to  the  message  text,  and  body  parts  have  security  levels  SI,  S2,  ...  Sn 
respectively,  then  the  message  M  has  security  level  SM,  w^here  SM=max(ST,  Si.  S2,  ...  Sn). 

An  MM  body  part  can  contain  forwarded-MM  and  forwarded-IP  messages. 

12  SPIN  Specification  of  MISSI 

In  order  to  utilize  model  checkers,  it  is  necessary  to  have  a  manageable  number  of  states,  which  leads 
to  attempts  to  save  on  the  number  of  states.  There  are  several  ways  to  reduce  number  of  states;  to 
bundle  several  consecutive  execution  steps  in  one  unit  using  atomic  statements;  to  reduce  channel 
sizes,  counters,  assert  and  other  statements,  and  number  of  variables  in  general;  and  to  make  an 
abstraction  of  the  system  and  focus  on  the  areas  of  interest  only. 

We  employed  all  these  approaches  in  our  modeling.  Our  model  consists  of  modules,  where  we 
abstract  the  modules  which  are  less  important  by  their  input-output  function,  and  focus  on  the 
modules  of  interest. 

Currently,  we  have  three  different  models  of  MISSI:  [6]  specifies  a  detailed  model  of  MISSI  sender 
with  an  abstraction  of  local  cashe  and  certificate  verification;  [7]  also  specifies  the  sender,  but  focuses 
on  the  local  cashe  and  certificate  verification,  and  abstracts  away  the  processes  necessary  to  prepare 
a  message  for  sending.  In  this  report,  we  abstract  away  the  sending  process,  and  focus  on  the 
receiver. 

13  Specification  of  MISSI  Receiver  and  Forwarding 

We  model  a  network  consisting  of  two  workstations  connected  to  the  WAN  via  SNS,  as  shown 
in  Figure  Fig.  10.  We  specify  receiver  algorithm  and  incorporate  descriptions  of  SNS  and  MSP 
functions  as  outlined  in  sections  9  and  7, 

Workstation  A  is  the  sender,  and  B  is  the  receiver.  We  simplified  the  sender,  in  order  to  eliminate 
unnecessary  states.  We  have  a  detailed  model  of  the  sender  and  message  verification  process  in  [6] 
and  [7].  In  this  report,  sender  model  details  only  the  structure  needed  to  assign  message  classification 
and  security  services  and  chose  recipients.  We  assume  that  the  message  passes  verification  tests  and 
preparation  for  sending. 

On  the  receiver  side,  w’e  assume  that  the  message  passes  verification  tests,  and  we  outline  the 
steps  needed  to  receive  the  message. 

We  have  two  receiver  models:  in  the  first  model,  which  we  called  Model  1,  we  assume  that  the 
sender  can  forw’ard  any  message  it  has  stored  previously.  In  Model  2,  we  assume  that  the  receiver 
can  forward  any  message  that  the  sender  has  sent.  We  had  to  split  the  forwarding  capabilities  into 
two  models  in  order  to  reduce  state  space  and  be  able  to  run  the  model.  We  did  merge  capabilities  of 
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Model  1  and  Model  2  together,  but  this  composite  model  would  always  run  out  of  memory,  because 
the  network  channels  are  quite  large  with  the  addition  of  a  forwarded  message  (which  adds  three 
fields  to  the  channels).  Therefore,  we  decided  to  split  the  capabilities. 

We  took  several  additional  measures  to  reduce  number  of  states.  We  “hard- wired”  the  two 
enclaves  together,  in  order  to  eliminate  routing  states.  We  did  not  model  calling  any  of  the  MSP 
functions  except  mspjstatus  and  mspjdeliver,  which  are  needed  for  receiving.  We  assume  that 
all  other  functions  are  called  and  are  executed  with  no  errors.  We  include  them  in  the  model  as 
comments.  In  Model  2,  we  do  not  include  any  MSP  functions,  not  even  mspjstatus  and  msp-deliver. 
We  do  model  them  in  Model  1,  Model  1  was  at  the  limit  of  available  memory  space.  SPIN  would 
ran  out  of  memory  if  we  tried  to  compile  the  model  as  pan  ‘“w25  “"m400000,  i.e.  to  limit  the  number 
of  states  to  2^®.  We  did  compile  the  model  with  -w28,  which  is  the  upper  limit  on  a  workstation 
with  320Mb  of  RAM. 

14  Variables 

We  use  the  following  variables,  with  their  values  in  parenthesis: 

wkst^capability:  workstation  security  level  (S,  SBU,  S,  or  Dual) 
enclaveJLevel:  enclave  security  level  (S,  SBU,  or  U) 

pers:  personality  used;  also  what  type  of  certificate  was  used  (organizational  SBU  and  S  person¬ 
alities/certificates  are  called  ORG_SBU_PERS  andT)RG-S_PERS;  individual  SBU  and  S  personali¬ 
ties/certificates  are  called  SBU-PERS  and  S-PERS;  and  non-MISSI  user  with  no  certificate  is  called 
UJ^ERS) 

to:  personality  of  the  receiver 
from:  personality  of  the  sender 

The  following  variables  refer  to  the  overall  message  sent/received: 
msg_classilication:  message  security  level  (S,  SBU,  or  S) 
serv^el:  security  services  applied  to  the  message  (NONE,  SIGN,  ENCRYPT) 
orgjnsg:  organizational  message  flag 

The  following  variables  refer  to  the  message  body  part  that  contains  forwarded  message: 
f  orwardedjnsg-clas:  message  security  level  (S,  SBU,  or  S) 

forwardedjservjsel:  security  services  applied  to  the  message  (NONE,  SIGN,  ENCRYPT) 
f  orwcirdedjorg-msg:  organizational  message  flag 

15  Modeling  Messages 

Since  SPIN  cannot  pass  arrays  as  channel  parameters,  we  modeled  message  encapsulation  and  “peel¬ 
ing”  of  message  headers  indirectly.  We  passed  header  fields  of  the  message  as  channel  arguments, 
and  assumed  that  those  arguments  which  are  “hidden”  are  not  accessible  because  the  message  was 
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not  stripped  to  that  part  yet.  Thos  earguments  which  are  ‘‘visible”  are  those  which  are  in  the  header 
that  is  currently  used. 

For  example,  the  user  sends  message  MSG,  which  contains  body  parts  BODYl  and  BODY2. 
We  add  header  HDR  to  the  message.  The  header  contains  fields  MSG.CLAS,  ORG-MSG,  and 
SERV-5EL.  MSG  and  HDR  together  make  an  MSP  message.  This  message  is  enveloped  by  X.400 
header  X400JIDR,  which  contains  FROM  and  TO  fields,  and  the  entire  MSP  message  becomes  the 
body  of  the  X.400  message.  We  would  like  to  model  sending  of  this  X.400  message  as: 

to.net !  X400.HDR ,  X4003ody 

where  X400.body  would  be  an  array  consisting  of  arrays  HDR  and  MSG.  However,  this  is  not 
possible  in  SPIN.  Sending  of  this  X.400  message  will  be  modeled  as: 

to.netIFROM,  TO,  dl,  d2,  d3,  d4,  d5,  d6,  d7,  d8,  d9,  dlO,  dll 

FROM  and  TO  fields  are  "visible,  ”  i.e.  they  are  used  for  the  current  routing  and  other  processing, 
dl,  ...,  d7  are  “invisible”  dummy  variables  which  contain  the  parts  of  the  message  which  are  currently 
enveloped  and  unaccessible.  We  need  to  “peel  off”  the  headers  in  order  to  reach  dl,  d7.  dl 
represents  field  MSG-CLAS,  d2  represents  field  ORG-MSG,  d3  represents  SERV-SEL,  d4,  d5,  d6 
and  d7  represent  MSG-CLAS,  ORG-MSG,  SERV_SEL  and  BODY  parts  of  BODYl  (which  is  a 
complete  message  in  itself);  and  d8,  d9,  dlO  and  dll  represent  BODY2. 

16  SNS  Model 

The  outbound  messages  are  examined  as: 
if 

::  (d5  <=  from  &&  d5  <=  to)  ->  skip: 
if 

::  (d5  ==  S  kk  d6*=SIGN+ENCRYPT)  ~>  skip; 
if 

;  ;  (enclave^level  ==  S  kJt  d6  >=  SIGN)  ->  skip; 

:  :  ! (enclave.level  ==  S  ftk  d6  >=  SIGN)  ->  wrong  ++; 
printf  ("wrong=y,d\n"  ,  wrong);  goto  end^net; 

fi; 

::!(d5  ==  S  kk  d6==SIGN+ENCRYPT)  ->  wrong  ++; 

printf ("wrong=y*d\n" ,  wrong);  goto  end.net; 
f  i; 

: ; ! (dS  <=  from  kk  dS  <=  to  )  ->  wrong  ++; 

printf ("wrong=y,d\n",  wrong);  goto  end.net; 

fi; 

to.wanfs]  !from,  to,  fdbl,fdb,  fdS,  fd6,  db,  d5  ,  d6; 

Inbound  messages  are  examined  as: 
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to_¥an[r]?from,  to,  fdbl,fdb,  fd5,  fd6,  db,  d5  ,  d6; 
atomic  { 

if 

::  (enclave_l€vel  ~  S  d6  >=  SIGN)  ->  skip; 

: :  !  (enclave.level  ==  S  kk  d6  >=  SIGN)  ->  wrong  ++; 

printf ("wrong=7,d\n",  wrong);  goto  end.net; 
fi; 

from_net[wkst]  !from,  to,  fdbl.fdb,  fdS,  fd6,  db,  dS  ,  d6; 

} 


17  PRBAC  Model 

In  case  the  message  was  encrypted,  we  perform  PRBAC.  We  assume  that  LRBAC  will  pass. 

prbac ; 
if 

: :  (msg.classif  ication  <=  pers  kk  msg.classif ication  <=  from  )  ->  skip; 
if 

::  (  msg.classification  <-  wkst.capability)  ->  skip; 

::!(  msg^classification  <=  wkst^capability)  -> 

msg.classif  ication  =  wkst.capability ;  /♦upgrade*/ 

fi;  ^ 

: : !  (msg.classif ication  <=  pers  msg.classif ication  <=  from)  -> 

wrong  ++;  goto  exit_; 
f  i; 

18  Personality  Switching 

We  model  that  receiver  can  switch  personality  during  a  session,  in  order  to  receive  a  newly  arrived 
message: 

if 

::  (to  “  pers)  ->  skip; 

: : ! (to  ==  pers)  -> 
if 

: :  (wkst.capability  -=  U  kk  to  ==  U_PERS)  -> 

pers  *  to;  /♦switch  personalities*/ 

(wkst^capability  SBU  kk  (to  SBU^PERS  I  1  to  ==  ORG_SBU_PE!RS  I  I  to  ==  UPPERS))  •> 
pers  =  to; 

(wkst^capability  ==  S  kk  (to  «  S.PERS  1 1  to  ==  ORG^S.PERS  11  to  ==  UPPERS)  )  -> 
pers  =  to; 

; ;  (wkst. capability  -=  Dual)  -> 
pers  =  to; 
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: :  !  (  (wkst. capability  ==  U  fcft  to  =*  UPPERS) 

II  (wkst.capability  ==  SBU  kk  (to  ==  SBU^PERS  II  to  ==  ORG^SBU.PERS  II  to  »=  UPPERS)) 

I  I  (wkst.capability  *=  S  kk  (to  *=  SUPERS  11  to  *=  ORG.S.PERS  I  I  to  «»  UPPERS)  ) 

II  (wkst^capability  ==  Dual))  -->  ¥rong++; 

assert (received.mail  ==  sent .mail  II  wrong  1); 
goto  exit.; 

fi; 

fi; 

19  Model  1 

In  Model  1,  we  assume  that  the  sender  can  pick  up  any  stored  message  and  forward  it.  This  is 
equivalent  to  forwarding  a  message  from  Pine  mailbox. 

if 

::  skip;  /*do  not  forward  anything^/ 

forward  =  1;  /♦pick  and  send  any  existing  message^/ 
if 

::  forwarded.msg.clas  *  S; 

::  f orwarded.msg.clcLs  -  SBU; 

::  forwarded.msg.clas  =  U; 

fi;  ^ 

if  /♦this  should  not  be  possible  for  U.PERS  or  wkst.level  =U  ♦/ 

::  forwarded.org.msg  =  1; 
forwarded.org.msg  =  0; 

fi; 

if 

::!(pers  ==  U.PERS)  -> 

if  /♦this  should  not  be  possible  for  U.PERS  or  wkst.level  «U  ♦/ 

::  f orwaxded.serv.sel  =  ENCRYPT  +  SIGN; 
f orwarded.serv.sel  =  SIGN; 
f orwarded.serv.sel  ~  0; 
fi; 

: :  (pers  ==  U.PERS)  ~>  skip; 
fi; 
fi; 

20  Model  2 

In  Model  2,  we  assume  that  receiver  is  forwarding  a  freshly  received  message.  The  receiver  is 
operating  from  personality  pers. 

if 

: :  (pers  »=  ORG.SBU.PERS  I  I  pers  *=  ORG.S.PERS  )  -> 
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->  forwarded.org.msg  =1; 

/♦organizational  users  can  send  only  organizational  msgs*/ 

:;(l(pers  ==  ORG^SBU.PERS  II  pers  —  ORG^S.PERS)  &&  org.msg  *“  0)  -> 
forwarded.org.iosg  =0; 

: :  CASEl  or  CASE2 
fi; 

/♦  CASE  1:  LIMITED  FORWARDING  OF  ORGANIZATIONAL  MESSAGES^/ 

/♦assume  that  non-org  user  CANNOT  forward  org  msgs^/ 

::(!(pers  =*  ORG.SBU^PERS  II  pers  ==  ORG.S^PERS)  &&  org^msg  !=  0)  -> 
goto  f inish.receiving; 

/♦  CASE  2:  UNLIMITED  FORWARDING  OF  ORGANIZATIONAL  MESSAGES^/ 

;:(!(p€rs  ==  ORG.SBU.PERS  II  pers  ==  ORG^S.PERS)  &&  org.msg  !=  0)  ~> 
if 

: :  (f  orwarded.serv^sel  ==  0)  ->  f  orwarded.org.msg  =  0; 

/♦since  the  forwarded  msg  is  not  signed,  it  will  not  be 
recorded  as  organizational^/ 

: :  ! (f orwarded.serv.sel  ==  0)  ->  f orwarded^org.msg  =  1; 

fi; 

21  Results 

We  ran  our  SPIN  model  on  an  Indigo-2  with  320Mb  of  RAM.  The  model  took  several  seconds  to 
execute. 

If  we  forward  a  message,  we  assert  that  the  resulting  message  classification  must  be  greater  that 
the  individual  parts: 

assert (forwarded^msg.clas  <=  msg.classif ication) ; 

If  we  forward  a  message,  we  assert  that  the  resulting  message  must  be  organizational  if  the 
original  was  organizational: 

assert (forwarded.org.msg  <=  org^msg) ; 

We  also  keep  counters  sent-mail  and  receivedjnail,  which  are  incremented  every  time  mail  is 
send  or  received.  In  case  SNS  or  the  receiver  rejects  a  message,  we  increment  counter  wrong.  We 
assert  that  either  we  receive  correctly,  or  the  message  is  lost: 

assert ( (received^mail  ==  sent_mail  &&  wrong  ==  0) 

II  (received_mail  ==  sent.mail  &&  wrong  ==  1)); 

indigo  >  spin  -a  main.spinlC 
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indigo  >  gcc  -DMEMCNT=28  -DBITSTATE  -DSAFETY  -o  pan  pan. c 
indigo  >  pan  -w24  -nSOOOOO 

We  ran  our  model  for  Model  1,  where  only  sender  can  forward,  and  for  Model  2,  where  only 
receiver  can  forward. 

21.1  Model  1 

All  assert  statements  are  true. 


indigo  >  psLH  *'W28  '"in400000 

(Spin  Version  2.9.4  --  4  Hovember  1996) 

+  Peortial  Order  Reduction 

Bit  statespace  search  for: 

never~claiin  ~  (none  specified) 

assertion  violations  + 

cycle  checks  "  (disabled  by  -DSAFETY) 

invalid  endstates  + 

State-vector  536  byte,  depth  reached  102,  errors:  0 
8.28066e+06  states,  stored 
1.30117e+06  states,  matched 
9.58183e+06  transitions  (=  stored+matched) 

7.67589e+06  atomic  steps 

hash  factor:  32.4172  (expected  coverage:  >=  98'/.  on  avg.) 

(max  size  2*28  states) 

4.44963e+09  equivalent  memory  usage  in  bytes  (stored*vector  +  stack) 
7.83458e+07  actual  memory  usage 

unreached  in  proctype  Guard 

line  67,  state  47,  "-end-" 


21.2  Model  2:  Unlimited  Forwarding  of  Organizational  Messages 

If  we  allow  unlimited  forwarding  of  organizational  messages,  assert  (forwardedx)rgjnsg  <=  orgjnsg) 

does  not  hold. 
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For  example,  assume  that  an  organizational  user,  let  us  call  him  A,  sends  a  signed  and  encrypted 
message  to  a  non-organizational  user  B.  B  saves  the  original  encrypted  message  and  tries  to  forward 
it.  Let  us  represent  the  original  message  as  OM.  B  constructs  a  new  message  NM,  which  will  be  used 
to  forward  OM.  Since  B  is  not  an  organizational  user,  he  cannot  mail  an  organizational  message; 
therefore,  we  have  an  organizational  message  OM  embedded  in  a  non-organizational  message  NM. 
This  embedded  message  is  still  encrypted,  which  provides  some  protection  -  unless  B  sends  his  key 
to  the  recipient  of  the  forwarded  message. 

The  same  scenario  could  have  happened  with  signed-only  original  organizational  message  OM. 

pan:  assertion  violated  (for»arded_org_Bisg<“org_nsg)  (at  depth  104) 
pan:  wrote  main. spinl6. trail 
(Spin  Version  2.9.4  —  4  November  1996) 

Warning:  Search  not  completed 

+  Partial  Order  Reduction 

Bit  statespace  search  for: 
never-claim 
assertion  violations 
cycle  checks 
invalid  endstates 

State-vector  448  byte,  depth  reached  107,  errorgj  1 
331578  states,  stored 
62288  states,  matched 
393866  transitions  (=  stored+matched) 

648730  atomic  steps 

hash  factor:  50.5979  (expected  coverage:  >=  98y,  on  avg.) 

(max  size  2‘24  states) 

1.56947e+08  equivalent  memory  usage  in  bytes  (stored*vector  +  stack) 

1.26272e+07  actual  memory  usage 

21.3  Model  2  Correction:  Limited  Forwarding  of  Organizational  Mes¬ 

sages 

If  we  allow  limited  forwarding  of  organizational  messages,  assert  (lorwarded^rgjasg  <=  orgjnsg) 
holds. 

(Spin  Version  2.9.4  —  4  November  1996) 

+  Partial  Order  Reduction 

Bit  statespace  seairch  for: 

never-claim  -  (none  specified) 

assertion  violations  + 

cycle  checks  -  (disabled  by  -DSAFETY) 


-  (none  specified) 

+ 

-  (disabled  by  -DSAFETY) 
+ 
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invalid  endstates 


State-vector  448  byte,  depth  reached  109,  errors;  0 
433240  states,  stored 
85051  states,  matched 
518291  transitions  (=  stored+matched) 

855639  atomic  steps 

hash  factor;  38.7249  (expected  coverage:  >=  98%  on  avg.) 

(mauc  size  2“24  states) 

2.02492e+08  equivalent  memory  usage  in  bytes  (stored*vector  +  stack) 
1.26272e+07  actual  memory  usage 

unreached  in  proctype  Guard 


It  is  necessary  to  prohibit  non-organizational  users  from  forwarding  organizational  messages. 
Therefore,  we  propose  the  following  requirement:  Every  message  sent  must  have  the  following  prop^ 
eriy:  If  a  message  M  consists  of  message  text  T  and  several  body  parts  Bf  B2,  ...  Bn,  the  message 
M  is  assigned  organizational  status  iff  any  of  the  body  parts  has  organizational  status  and  it  is  pos¬ 
sible  to  assign  organizational  status  to  M.  If  it  is  not  possible  to  assign  organizational  status  to  M, 
the  message  cannot  be  sent.  ^ 

22  Conclusion 

In  our  experience,  English  specifications  are  ambiguous  and  incomplete.  Even  specifications  that 
sound  correct  can  lead  to  incorrect  results.  At  least,  English  specifications  should  contain  some  kind 
of  pseudO”Code  or  flow  diagram,  in  order  to  see  more  clearly  all  “if-then-else”  branchings  and  other 
possibilities. 

Protocol  specification  usually  relies  on  other  standards  and  protocols,  as  MISSI  relies  on  X.500, 
X.400,  ACP  123  and  SDN. 701,  and  the  interface  needs  to  be  clearly  specified. 

Formal  spefications  and  proofs,  augmented  with  textual  descriptions,  provide  a  very  clear  and 
detailed  documentation  for  the  system,  aid  in  understanding  of  the  system,  and  expose  weak  areas 
of  system  design.  Formal  specification  can  be  used  as  a  blueprint  for  implementation,  and  formal 
proofs  can  be  used  as  a  blueprint  for  implementation  testing. 

Formal  verification  assures  of  desired  system  properties.  Desired  system  properties  consist  of 
original  specification,  enhanced  by  additional  requirements  that  formal  specification  uncovers. 

Formal  specification  must  contain  some  implementation  bias.  Specifiers  must  be  careful  to  min- 
imize  it. 

Model  checkers  always  eventually  run  out  of  memory.  In  order  to  run  a  model,  we  must  divide  it 
in  several  sub-models,  and  validate  each  sub-model.  The  sub-models  can  represent  the  same  system. 
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where  each  sub-model  emphasizes  a  particular  feature  of  the  system  and  simplifies  the  other  features 
(e.g.  one  sub-model  focuses  on  login  and  logout,  another  on  sending  mail,  another  on  receiving  mail). 
The  sub-models  can  be  parts  of  a  bigger  model  (e.g.  each  submodel  is  a  layer  in  a  layered  protocol.) 
In  this  work,  we  used  the  first  approach. 

SPIN  is  useful  as  a  tool,  especially  because  it  resembles  C. 

Future  work  includes  further  formalization  of  the  required  properties  into  a  form  that  can  be 
machine  checked.  We  plan  to  use  either  first  or  higher  order  logic,  which  can  be  verified  using 
Penelope  or  HOL. 
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Abstract 


In  this  report,  the  performance  of  receivers  processing  designed  LPI  (Low  Probability  of  Intercept) 
signals  is  described.  Tlie  LPI  signals  involve  spread  spectrum  modulation  with  a  common  digital 
communication  structure  generated  by  a  so-called  flip-\\ave  signal.  The  signal  spectrum  is  spread 
by  direct  sequence  methods  while  using  various  tvpes  of  wave  shapes  involving  HBE  (High 
Bandwidth  Efficiency)  pulses  imposed  on  PN  (Pseudo  Noise)  spreading  codes.  Signals  arriving  at 
the  receiver  are  processed  (at  baseband)  by  a  D&M  (Delav'  and  Multiph  )  receiver  that  acts  as  the 
detector  of  spread  spectaim  signals.  Tlie  effect  of  sources  of  interference  on  the  receiver  such  as 
noise  and  channel  induced  multipath  are  evaluated.  A  post  D&M  receiver  processor  compares  the 
signal  of  a  squared  integral  operation  on  the  D&M  receiver  filtered  output  with  an  adjustable 
threshold  level  for  the  purpose  of  signal  presence  detection.  Performance  results  via  computer 
system  simulations  are  carried  using  a  block  oriented  simulation  software  package  knoum  as  Signal 
Processing  Workbench  (SPW). 
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1  Introduction 


1.1  Direct  Sequence  Spread  Spectrum 

The  so-called  spectrum  is  a  frequency-domain  representation  of  tlie  characteristics  of  a  system  or 
signal,  showing  the  operating  frequencies  or  the  bandwidth,  respectively.  Transformation  between 
time-domain  and  frequency-domain  descriptions  is  carried  out  by  the  Fourier  transform  pair, 
mathematically  defined  by, 

G(j)=]g(l)e-’'--'Jl  (I.l) 

-CO  -CO 


A  spread  spectrum  s\’stcm  is  one  in  which  the  transmitted  signal  is  spread  over  a  frequency  band 
that  is  much  wider  than  the  minimum  bandwidth  required  to  transmit  the  information  being  sent. 

An  example  of  a  spread  spectrum  system  is  wideband  FM,  where  the  RF  spectrum  produced  is  a 
fimetion  of  the  information  bandwidth  and  the  amount  of  modulation.  A  t>pe  of  spread  spectrum 
signaling  of  interest  insofar  as  this  report  is  concerned  is  Direct  Sequence  Spread  Spectnim 
(DSSS)  [2],  where  a  carrier  signal  is  modulated  by  a  digital  code  sequence  (which  in  most  cases  is 
a  pseudo  noise  sequence)  whose  signaling  rate  is  much  higher  than  the  information  signal  bit  rate. 
DSSS  systems  produce  a  low  level  power  dcnsit\’  spectrum  which  gives  the  signal  its  low 
probability  of  intercept  (LPl)  property.  Tlte  LPl  feature  of  spread  spectrum  systems  lend 
themselves  to  communications  and  data  transmission  with  message  privacy  and  signal  hiding.  In 
the  early  1950's  these  features  were  mainly  of  importance  to  military^  communications,  but 
nowadays,  many  commercial  applications  are  to  be  found  in.  for  example  cellular  communication 
systems,  position  location  devices,  and  multi-user,  multi-channel  data  communication  networks  [2]. 

1.2  Pseudo  Noise  Sequence  Generator 

PN  Sequence  generators  are  feedback  shift  register  s\'stems  with  a  specific  register  length, 
producing  a  digital  code  sequence  of  prescribed  length  and  period  [2].  The  code  is  made  up  of 
binary  bits  called  chips  w  hose  repetition  rate  is  a  fimetion  of  the  chip  rate, 

^  _  clock  rate  in  chips  per  second 

code  length  in  chips 


and  the  code  length  is  dependent  on  the  number  of  shift  register  stages  as  well  as  on  the  feedback 
connections.  Figure  1-1  shows  a  PN  sequence  generator  configuration  with  a  feedback  connection. 
Specific  feedback  connections  determine  the  code  produced  by  the  sequence  generator. 


Code 

output 


Figure  1-1;  A  PN  sequence  generator  configuration 


1.3  Delay  and  Multiply  Receivers 

Delay  and  Multiply  (D&M)  receivers  are  used  as  presence  detectors  of  a  signal  in  noise  and 
interference.  The  objective  of  the  detector  is  to  identity  the  presence  of  the  signal  without  seeking  to 
determine  the  actual  transmitted  message[5].  Known  signals  are  detectable  by  either  passing  the 
received  waveform  through  a  filter  matched  to  the  signal,  or  b\'  correlating  the  received  waveform 
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with  a  reference  that  is  proportional  to  the  signal.  \Vlien  the  signal  is  not  known,  matched 
filters/corrclators  are  not  particularly  useful  because  of  the  required  signal  knowledge.  Digitally 
modulated  signals  cannot  be  considered  known  unless  the  sequence  that  modulates  the  signal  is 
known.  BPSK/DSSS  signals  are  BPSK  wavefonns  [4]  with  an  imposed  pseudo  noise  (PN) 
spreading  sequence.  Tlicrcfore  matched  filters/  correlators  cannot  be  used  to  presence  detect 
BPSK/DSSS  signals  unless  the  PN  spreading  sequence  is  known.  Detection  by  using  a 
conventional  analog  spectrum  anaKzer  or  FFT  processing  is  difficult  because  BPSK  signals  are 
not  periodic  (due  to  the  random  nature  of  the  sequence)  and  therefore  have  continuous  Fourier 
spectra.  Furthermore,  DSSS  signals  have  a  pseudocontinuous  spectrum  with  spectral  lines 
separated  in  frequency  b\’  the  reciprocal  of  the  duration  of  the  PN  sequence.  However  when  certain 
nonlinear  operations  are  applied  to  BPSK  for  BPSK/DSSS  signals,  discrete  spectral  components 
arise  [3].  These  components  are  then  often  detectable  using  spectaim  analyzer/FFT  processing 
techniques. 

Figure  1-2  show's  a  D&M  receiver  Block  Diagram,  where  the  implementation  of  a  quadratic 
nonlinearity'  with  w'hich  the  receiver  generates  the  discrete  spectral  components  at  the  DSSS 
signal’s  chip  rate  frcquencN*  can  be  seen.  Optimization  of  the  signal  detectability  of  the  D&M 
receiver,  in  the  sense  of  maximizing  the  spectral  SNR  of  the  signal’s  chip  rate  frequency  spectral 
line  can  be  estimated  by  the  delay  setting  ‘"d”  of  the  receiver.  A  special  case  corresponding  to  d=0 
gives  the  quadratic  structure  of  the  BPSK/DSSS  signal.  Autocorrelation  fiinctions  show  that  a 
maximum  pow'er  density  at  the  D&M  output  is  obtained  at  the  chip  rate  frequency  for  a  delav 
setting  equal  to  half  of  the  chip  duration. 


Figure  1-2:  D&M  Receiver  Block  Diagram 


1.4  SPW 

SPW  is  a  UNIX  based  software  package  used  for  the  purpose  of  simulating  communication  and 
signal  processing  systems.  SPW  has  been  proved  to  be  a  pow'erful  tool  to  evaluate  system 
performances.  Tlie  block-onentated  characteristic  allow's  building  and  simulating  of  complicated 
systems  with  the  same  ease  as  would  be  necessary'  for  more  simple  systems.  Schematics  are  entered 
in  the  Block  Diagram  Editor  (BDE)  where  systems  are  build  up  by  a  hierarchy  of  blocks  from 
existing  libraries.  User  defined  blocks  can  be  saved  in  a  personal  library'  as  part  of  a  complete 
system  and  as  syinbols  representing  the  complete  system.  Tliese  blocks  and  symbols  again  can  be 
imported  into  the  hierarchx  of  blocks  of  some  other  system.  Once  a  block  is  added  to  a  schematic, 
modifications  within  that  block  are  only  effective  within  that  schematic,  which  gives  rise  to  the 
possibiliu-  of  building  up  identical  systems,  however  working  under  different  conditions.  After 
building  a  system,  simulations  can  be  carried  out.  w  hile  showing  the  signals  of  interest  in  a  sianal 
calculator  mode.  Displa\  of  signals  of  interest  following  these  simulations  can  be  done  by 
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utilization  of  a  built  in  calculator  or  a  more  complicated  process  such  as  an  FFT  to  yield  the 
particular  analysis  of  interest. 


2  Transmitter  and  Receiver  System 


Figure  A-1  shows  the  block  diagram  generated  by  the  BDE  of  the  system  that  is  used  for  spreading 
an  input  binary'  random  signal  with  PN  sequences  shaped  by  HBE  pulses  (see  section  2.2)  and 
signal  detection  by  a  complex  delas’  and  multiply  receiver.  Simulations  of  the  signal  spreading 
operation  with  PN  sequences  shaped  b\’  raised  cosine  pulses  are  shown  in  Figure  A-3.  For 
purposes  of  this  study,  the  signal  is  to  be  detected  b\'  a  D&M  receiver  and  a  post-processing 
threshold  level  detector.  A  binary  random  data  signal  with  a  specific  form  of  spread  spectrum 
modulation  is  used  as  the  digital  communication  signal  of  interest.  Tlie  spectrum  of  this  spectrally 
spread  random  data  signal  when  processed  b\'  a  D&M  receiver  tends  to  show  a  spectral  line  at  tlie 
chip  rate  frequency  making  it  therefore  somewhat  easy  to  detect.  Tlie  purpose  of  this  project  is  to 
evaluate  the  detectability  of  this  signal  under  various  operational  scenarios  involving  noise  and 
multipath  effects  on  the  signal  and  receiver  processing  modifications.  In  order  to  reduce  the 
detectabiliU'  of  this  signal,  its  spectrum  before  transmission  has  to  appear  featureless.  A  featureless 
signal  will  show  a  spectrum  that  to  the  observer  appears  like  unusable  noise  without  any  significant 
spectral  lines,  Tlie  flip-wave  signal  generator  is  a  device  by  which  after  spectrum  spreading,  a 
featureless  signal  as  described  below  is  produced. 

2.1  Flip-Wave  Generator 


The  basis  of  the  flip-w'ave  signal  involves  QPSK  (Quadrature  Phase  Shift  Keying)  modulation.  A 
conventional  QPSK  modulator  is  shown  in  Figure  2-1. 


rfs- :: 


it? 


Figure  2-1:  QPSK  modulator  block  diagram 

The  random  data  input  signal  is  mathematically  described  as 


^(0=  -»</<x 

it  =-00 


(2.1) 


where  the  data  {/I* !  is  a  ±1  valued  random  sequence  and  p(t)  is  a  deterministic  pulse. 

It  is  assumed  that 

E{Ai^  }  =  0  {i.e.j'.v.'  s  are  zero  mean)  (2,2) 
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(2.3) 

(2.4) 

(2.5) 

(2.6) 


where 

,/(/)=/, (//2)  (2,7) 

A  so-called  offset  QPSK  (OPQSK)  modulated  signal  results  w  hen  amplitude  modulation  of  sine 
and  cosine  carriers  is  imposed.  Tliat  is, 

^OQrSK  (0  ~  I  .fc^  ~ 

=  A  cos[2;t  fj  +  ^(/)]  (2.8) 

The  carrier  phase  now  is 

^(t)  =  tan"'  -2^-2  (2.9) 

Zjit) 

and  can  take  on  values  ±7t/4,  ±3tc/4  as  a  result  of  the  fact  that  Z|(t)  and  ZQ(t)  are  bipolar 
waveforms.  The  wavefonn  produced  by  the  flip-wave  generator  is  built  up  according  to  the 
following  algorithm;  w  hen  Ak=l  the  carrier  phase  in  the  interv’al  (k-t-l)Tb<t<(k-i-2)Tb  is  that  of  the 
carrier  in  the  inter\'al  kTh<t<(k+l)Th  incremented  by  n/l  radians,  and  conversely  when  Ak=-l  the 
carrier  is  incremented  by  -n/2  radians.  Tlie  BDE  generated  block  diagram  of  the  flip-w'ave 
generator  is  shown  in  Figure  A-2  of  the  Appendix.  Tlie  flip-wave  generator  used  to  produce  the 
LPI  signal  generates  a  waveform  b\'  defining  two  sequences  of  r.v.’s,  {Xk}  and  {Yk}  which  are 
related  to  { Ak}  bv  the  recursions 

Xk.,=-AkYk  '  Yk.,=AkXk  k=0.1,2....  (2.10) 


The  initial  values 

Xo=l 

Yo=0 

(2.11) 

define  the  starting  time  t=0. 

Now  X(t)  and  Y(t)  are  defined  by 

X{t)  =  f^X,p{t-kT,) 

k^O 

1 

II 

(2.12) 

In  Figure  A-4,  the  signal  simulation  of  the  one-input/tw'o-output  flip-w  ave  generator  and  the 
transformation  signals  specified  b}’  Equation  2.10  and  Equation  2,12  with  the  random  data  input 
are  shown.  Tliese  waveforms  take  on  the  values  0,  ±1  in  such  way  that  if  X(t)  equals  ±1,  then  Y(t) 


4-9 


must  be  zero  valued  and  vice  versa  Tliis  is  the  basis  of  the  so-called  flip-wave  signal  which  is  an 
example  of  differential  phase  modulation  in  an  OQPSK  format  where  at  any  one  signaling  interval, 
a  phase  change  of  ±7r/2  rad.  must  take  place.  DSSS  modulation  for  spreading  of  the  flip-wave 
signal  is  accomplished  by  generating  spreading  codes,  using  linear  feedback  shift  register  systems 
and  various  shaped  HBE  pulses.  Detailed  mathematical  descriptions  [1]  of  these  spreading  methods 
results  in  the  expression  of  the  spread  flip-wave  signal  processed  by  the  receiver,  to  be  given  by 

=  Y.a„h{r-mr)+jY,Y,,.,p'it-i2k  +  \)T)f^j3,h{l~i2l  +  [)T/2)  (2.13) 

*=0  m  =  k~0  •  /=0 

where  {exm}  and  {pi}  are  ±1  valued  M-sequences,  T’  is  the  chip  rate  of  the  spreading  codes,  and 
h(t)  is  one  of  the  various  HBE  pulses  imposed  on  the  PN  spreading  codes. 

2.2  High  Bandwidth  Efficiency  Pulses 


Tlie  various  HBE  pulses  considered  are  described  as  follows: 

1.  Rectangular  shaped  pulses 

Let  T  be  the  duration  of  one  pulse  period,  and  the  rectangular  pulse  be  described  by 
0  ()fhen\'ise  ,  _ _ 


where  the  time  shift  between  spreading  codes  is 
T 

given  by,  —  =  T^, .  (T*  is  the  chip  rate  as  indicated  o 

2  0  1/2T  T 

above.) 

Figure  2-2:  Rectangular  Pulse 

Tlte  HBE  pulses  shapes  to  follow  arc  nonnalizcd  in 

amplitude  to  that  of  the  rectangular  pulse  shape  in  order  to  obtain  equal  signal  energies. 
2.  Raised  Cosine  shaped  pulses 


/,(/): 


I  sin  -{2m/4T) 
1  0 


0<i  <T  =  2T^ 
otherwise 


1/2  T 


(2.14) 


(2.15) 


Sine  shaped  pulses 

The  implementation  of  the  sine  shaped  pulses  Figure  2-3:  Raised  Cosine  Pulse 
is  obtained  b\'  exciting  finite  impulse  response 

(FIR)  filters  where  the  filter  coefficients  are  set  to  correspond  to  the  numerical  amplitude 
values  of  the  pulse  function.  Limited  numbers  of  coefficients  in  each  filter  block  are  the  reason 
why  for  longer  duration  pulses  parallel  filters  with  appropriate  time  delavs  are  used  in  order  to 
accomplish  the  desired  implementation. 

A.  Sine  fiinction  shaped  pulses  of  duration  T=4Tc  realized  using  w  hat  is  labeled  as  Filter 

(^)^sin(2£(^-27^)^  0<,<r  (2,16) 

2.t(/-27:.)/47;- 

b)  Hanning  weightced  sine  function  shaped  pulses  of  duration  T=4Tc  realized  using  what  is 
labeled  as  Filter  AW. 
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(0  ”  ^^s•/.vc•  (0* 


^;r/^ 


K^T,  j 


0<i<T 


(2.17) 


0  1/2  T  T 

Figure  2-4;  Sine  fiinction  shaped  pulses  of  duration  T=4Tc 

c)  Sine  ftinetion  shaped  pulses  of  duration  T=8Tc  realized  using  what  is  labeled  as  Filter  B, 

sin(2;r(/-47;.)/47;)  o</<7’  (2-18) 


^sme  (0  ~ 


27r{t-4T^)l4T^ 

d)  Hcinning  \veighted  sine  fiinction  shcipcd  pulses  of  duration  T— 8Tc  rcnlized  using  what  is 
labeled  as  Filter  BW. 


^HWSINC  (0  ~  ^SISC  (0  * 


^  TT 


^87,  J 


0<t<T 


(2.19) 


-Filter  B 
-  Filter  BW 


Figure  2-5:  Sine  funetion  shaped  pulses  of  duration  T-8Tc 

e)  Sine  function  shaped  pulses  of  duration  T=16Tc  realized  using  what  is  labeled  as  Filter  C, 

sm(2j(/-8r,)/4r,)  ,2.20) 

2;r((-87-r)/47;. 

f)  Hanning  weighted  sine  ftinetion  shaped  pulses  of  duration  T=16Tc  realized  using  what  is 
labeled  as  Filter  CW. 


^^mrsiNC  (0  ~  ^s/,vc  (0  * 


Q<t  <T 


(2.21) 


4-11 


Filter  C 
Filter  CW 


Figure  2-6:  Sine  fiinction  shaped  pulses  of  duration  T=16Tc 


2.3  PN  Generator 

Three  parallel  m-sequence  PN  generators  are  used  to  produce  amplitude  modulation  (AM  )  on  the 
chips.  Tlie  number  of  AM  levels  depends  on  the  weight  assigned  to  each  parallel  generator;  hence 
the  possible  outcomes  of  the  amplitude  variable  with  its  corresponding  probabilities. 
Normalization  of  the  different  amplitudes  is  realized  by  equalizing  the  output  by  the  corresponding 
mean  squared  values,  where 

(2.22) 

n=0 


The  different  cases  are  summarized  in  Table  2- 


chip 

weight 

#  of  levels 

a„, 

probability 

E{aM 

m 

1.1,1 

4 

±3 

'A 

3 

±1 

V, 

4.1.1 

6 

±6 

% 

18 

±4 

'/2 

±2 

% 

4-2.1 

8 

±1 

% 

21 

±5 

% 

±3 

'/4 

±1 

'/4 

Table  2-1  PN  sequence  AM 


2.4  Di&M  receiver 

The  complex  D&M  receiver  can  operate  in  two  different  switch  selected  modes.  There  is  a  choice 
of  processing  the  baseband  signal  with  or  without  comple.x  conjugation  prior  to  multiplication.  If 
complex  conjugation  is  selected,  the  input  signal  is  multiplied  by  its  complex  conjugated  delayed 
version.  Tliis  is  shown  in  Figure  2-7. 
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Figure  2-7:  Block  diagram  of  the  Delay  and  Multiply  Receiver  with  switch  selected  CC 

3  Simulations 


3.1  Delay 


One  of  the  variables  that  affects  the  output  of  the  D&M  receiver  is  its  delay  setting.  By  changing 
the  delay  of  the  D&M  receiver,  the  spectmm  of  the  receiver  output  will  be  affected,  making  the 
delay  setting  an  important  variable  which  influences  the  signal  detection  probability  performance  of 
the  receiver.  For  different  t\pes  of  HBE  pulses  the  effect  of  the  delaN'  setting  on  the  performance 
of  the  D&M  receiver  as  well  as  the  effect  of  using  complex  conjugation  prior  to  multiplication  is 
analyzed. 


3.1.1  Raised  cosine  pulse 

For  simulation  purposes  a  raised  cosine  pulse  is  used  with  a  chip  rate  frequency  of  fc=256  kHz. 
Tlie  time  shift  between  spreading  codes  is  given  by. 


=  1 .953  1  //sec 


(3.1) 


With  a  simulation  sampling  frequency  set  to  fs=5.120  kHz.  one  chip  interval  represents  f,  /fc  =20 
samples.  Tlie  presence  of  spectral  lines  in  the  spcctaim  at  the  output  of  the  D&M  receiver  under 
different  delay  settings  for  raised  cosine  shaped  pulses  are  described  and  sununarized  in  Table  3-1 
and  Table  3-2.  Figure  A-5  in  the  Appendix  shows  some  of  the  spectra  obtained  in  the  SPW  signal 
calculator  mode  for  raised  cosine  shaped  pulses  with  and  without  complex  conjugation.  It  is 


observed  that. 


Raised  cosine  pulse  without  Complex  Conjugation 

Delay 

(sample  points) 

visible  spectral  lines 

Delay-0 

fc 

phase  of  spectmm  is  zero  for  4  f.  6  f.  8  f.  10  f 

Delays  1-2 

fc 

zero-phases  disappear 

Dela\-3-4 

fc.  5  fc.  7  fc 

Delay=5 

fc.  3  fc.  5  fc  9  fc 

Delay=6 

fc  3  fc  5  fc  7  fc 

Dela\=7-10 

fc,  3  fc  5  fc  7  fc.  9  fc 

Delay=ll-15 

3  fc  5  fc  7  fc 

Dela\'=  1 6 

5  fc  7  fc 

Delay- 17-20 

No  significant  spectral  lines 

Table  3-1  Spectral  lines  for  Raised  Cosine  shaped  pulses  versus  delay  without  CC 
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Using  delay  settings  of  less  than  or  equal  to  half  a  chip  duration  results  in  the  presence  of 
strong  spectral  lines  at  the  frcquenc>'  f;  in  the  output  of  the  D&M  receiver. 

Spectral  lines  at  frequencies  which  are  multiples  of  the  chip  rate  frequency  do  not  exceed  the 
spectral  power  level  around  the  chip  rate  frequcnc\'. 

Tlie  spectral  line  at  the  frcquenc\’  f^  at  the  output  of  the  D&M  receiver  without  Complex 
Conjugation  disappears  for  delays  greater  than  half  a  chip  duration  (10  sample  delay)  .  Also  in 
this  case  the  spectral  lines  at  frequencies  which  are  multiples  of  fc  do  not  exceed  the  spectral 
power  level  around  the  chip  rate  frequency. 

At  a  receiver  dela>'  setting  near  1  chip  duration  (20  samples)  in  length,  all  spectral  lines  at  the 
output  disappear. 


Raised  cosine  pulse 

with  Complex  Conjugation 

Delay 

(sample  points) 

visible  spectral  lines 

Dela\'=0-2 

OHz.  2  fc 

Delav^B 

OHz.  2  f..  6  fc 

Dela\’=4 

OHz.  2  fc.  4  fc.  6  fc 

Dela\=5-9 

OHz.  2  fc.  4  fc.  6  fc.  8  t; 

Delay  - 1 0 

OHz.  2  fc 

Delays  1 1 

OHz.  2  fc.  4  fc.  6  fc.  8  fc 

Dela\=12-13 

2  fc  4  fc  6  fc.  8  fc 

Dclay=I4 

4  fc  6  fc.  8  fc 

Delay=15-16 

4  fc  6  fc 

Delay=  17-20 

No  significant  spectral  lines 

Table  3-2  Spectral  lines  from  Raised  Cosine  versus  delay  with  CC 

•  Receivers  with  CC  produce  at  their  output  spectral  lines  at  even  valued  multiples  of  fc  versus 
odd  valued  multiples  of  fc  for  receivers  without  CC. 

•  A  ftirther  difference  between  a  receiv  er  using  Complex  Conjugation  and  a  receiver  that  does 
not,  is  that  the  fonner  produces  an  output  spectral  line  at  zero  frequency,  as  long  as  it  is  not 
suppressed  due  to  the  use  of  a  large  value  of  dela\'. 

Spectral  Strength  at  the  Chip  Rate  Frequency  with 
Variable  Receiver  Delay  for  Raised  Cosine  shaped  pulses 

OfTni — j — rrn — T~~r: — \~~~r 
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Figure  3-1:  Magnitude  of  chip  rate  frequency  spectral  for  lines  Raised  Cosine  pulses 

•  The  main  lobe  bandwidth  of  the  receiver  output  spectrum  is  approximately  8*fc. 

•  The  spectral  magnitude  of  the  chip  rate  frequenc\’  component  is  maximum  for  small  delays. 

•  A  spectral  strength  of  approximately  -50  dB  is  considered  to  be  the  level  at  which  the  spectral 
lines  are  not  detectable  \\  hen  compared  to  the  rest  of  the  spectaim. 

•  For  delays  greater  than  10  samples  (half  a  chip  duration  interval)  the  spectral  line  at  the  chip 
rate  frequency  cannot  be  discriminated  from  the  rest  of  the  spectrum. 

Figure  3-1  shows  the  changing  strength  of  the  spectral  component  at  fc  as  a  function  of  delay  for  a 
receiver  that  uses  or  bx^passes  complex  conjugation. 

3. 1. 2  Rectangular  shaped  pulse 

The  same  chip  rate  frequency  of  fc=256kH2  and  sampling  frequency  of  fs=5, 120ksps  are  used  as 
in  the  case  of  Raised  Cosine  pulses.  Again,  one  chip  duration  interval  represents  fs  /fc  =20 
samples.  Tlie  presence  of  spectral  lines  in  the  spectaim  at  the  output  of  the  D&M  receiver  under 
different  delav  settings  for  rectangular  shaped  pulses  are  described  and  summarized  in  Table  3-3 
and  Table  3-4.  Figure  A-6  in  the  Appendix  shows  some  of  the  spectra  obtained  in  the  SPW  signal 
caculator  mode  for  rectangular  shaped  pulses,  It  is  observed  that. 


Rectangular  shaped  pulse  without  Complex  Conjugation 

Delay 

(sample  points) 

visible  spectral  lines 

Delav-0 

No  significant  spectral  lines 

Dela\'=  1 

3  fc.  5  fc.  7  fc.  9  fc 

Dela\-2 

fc.  3  fc.  -3  fc.  7  fc.  9  fc 

Dela\-3 

fc.  3  fc.  5  fc.  9  fc 

Dela\'=4-10 

fc  3  fc  5  fc  7  fc  9  fc 

Delav=ll 

fc  3  fc  5  fc  7  fc 

Delav  - 1 2 

fc  3  fc  7  fc  9  fc 

Delav- 13 

fc  5  fc  7  fc.  9  fc 

Delay- 14- 15 

fc  3  fc  fc  7  fc  9  fc 

Delav- 16 

fc  3  fc  7  fc  9  fc 

Delav- 17 

fc  3  fc  5  fc  9  fc 

Delav— 18 

fc  3  fc.  .s  fc.  7  fc  9  fc 

Delav—  1 9 

3  fc  5  fc  7  fc  9  fc 

Delav— 20 

1  No  significant  spectral  lines 

Table  3-3  Spectral  lines  from  Rectangular  shaped  pulse  versus  delay  without  CC 

•  Spectral  lines  at  the  frequency  fc  are  strong  for  receiver  delays  set  between  approximately  4 
and  17  sample  points. 

•  Spectral  lines  at  frequencies  which  are  multiples  of  the  chip  rate  frequency  do  not  exceed  the 
spectral  power  level  around  the  chip  rate  frequency. 

At  a  receiver  delay  set  to  i  chip  duration  (20  samples)  all  spectral  lines  at  the  output 
disappear. 
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Rectangular  shaped  pulse  with  Complex  Conjugation 

Delay 

(sample  points) 

visible  spectral  lines 

Dclay=0 

OHz 

Dcla\'=l-4 

OHz.  2  T.  4  fc  6  f,.  8  fc 

Dela\-5 

OHz.  2  fc.  6  fc 

Dela\'=6-9 

OHz.  2  fc,  4  fc,  6  fc  8  fc 

Delay  - 1 0 

OHz 

Delay- 11-14 

OHz.  2  fc  4  fc  6  fc  8  fc 

Delay—  1 5 

OHz.  2  fc  6  fc 

Dela\-16-17 

OHz.  2  fc  4  fc  6  fc  8  fc 

Delay- 18- 19 

2  fc  4  fc  6  fc  8  fc 

Delay— 20 

No  significant  spectral  lines 

Tabic  3-4  Spectral  lines  from  Rectangular  shaped  pulse  versus  delay  with  CC 

•  The  receiver  with  CC  produces  at  its  output  spectral  lines  at  even  \alued  multiples  of  fc  versus 
odd  valued  multiples  of  fc  for  the  receiver  without  CC. 

•  The  spectral  line  at  zero  frequenc\'.  if  not  suppressed  due  to  high  receiver  delay  settings  is 
visible  at  the  output. 


Spectral  Strength  at  the  Chip  Rate  Frequency  with 
Variable  Receiver  Delay,  Rectangular  shaped  Pulses 


Spectral 

Magnitude 

(dB) 


□  Without  CC 
m  WithCC 


Figure  3-2:  Magnitude  of  chip  rate  frequency  spectral  lines  of  rectangular  shaped  pulses 

•  The  main  lobe  bandwidth  of  the  output  spectnim  varies  between  approximately  2*fcand  8*fc 
Hz  for  the  different  recci\er  dela>'  settings. 

•  A  spectral  strength  of  approximateh-  -65  dB  is  considered  to  be  the  level  at  which  the  spectral 
lines  are  not  detectable  when  compared  to  the  rest  of  the  spectmm. 

Figure  3-2  shows  the  changing  strength  of  the  spectral  component  at  fc  as  a  function  of  delay  for  a 
receiver  that  uses  or  b\passes  complex  conjugation. 

3.L3  Sine  pulse 

•  Analogous  to  the  above  described  simulations,  the  strength  of  the  chip  rate  frequency  spectral 
lines  are  presented  graphically  in  Figure  3-3  to  Figure  3-8  for  cases  in  which  the  pulse  shape 
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are  of  various  sine  fiinction  form  as  described  in  Section  2.2.  One  chip  duration  is  equal  to  20 
samples.  The  receiver  output  spectra  observ'cd  from  the  various  simulations  carried  out  are 
shown  in  Figures  A-7  to  A-9  in  the  Appendix.  The  focus  of  the  output  spectral  lines  for 
systems  with  CC  is  at  frequencies  that  are  twice  as  high  (312  kHz)  as  for  systems  without  CC 


(256  kHz). 


Spectral  Strength  at  the  Chip  Rate  Frequency  with  Variable 
Receiver  Delay  for  Sine  Pulse  shaped  by  Filt  A 


Spectral 

Magnitude 

(dB) 


□ 


[Without  CC 
With  CC 


Figure  3-3;  Magnitude  of  chip  rate  frequency  spectral  lines  for  Sine  pulse  shaped  by  Filter  A 

Simulation  results  for  D&M  rccci\'ers  processing  a  spread  flip-wave  signal  with  sine  shaped  HBE 
pulses  described  by  Equation  (2.16)  are  shown  in  Figure  A-7a  and  Figure  A-7b.  Referring  to  these 
simulation  results  the  following  observations  can  be  made. 

•  Tlie  main  lobe  bandwidth  of  the  output  spectrum  varies  between  approximately  5*fc  and  7*fc. 

•  The  spectral  line  at  the  chip  rate  frequency  for  both,  a  receiver  using  or  b>passing  complex 
conjugation  is  not  x  isible  anymore  for  delays  greater  than  14  samples,  where  20  samples 
represent  the  length  T. 

•  A  spectral  strength  of  approximatcK'  -50  dB  is  considered  to  be  the  level  at  which  the  spectral 
lines  are  not  detectable  when  compared  to  the  rest  of  the  spectaim. 

•  The  spectral  lines  at  the  output  of  the  system  with  CC  for  dcla\  s  smaller  than  the  mentioned  14 
samples  are  all  clearly  visible,  but  the  strength  may  not  always  exceed  the  set  -50  dB  minimum 
level  especially  for  the  case  of  smaller  delay's. 

Figure  3-3  shows  the  ch.anging  strength  of  the  spectral  component  at  fc  as  a  function  of  delay  for  a 
receiver  that  uses  or  b\passes  complex  conjugation. 
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Spectral  Strength  at  the  Chip  Rate  Frequency  with  Variable 

Receiver  Delay  for  Sine  Pulse  shaped  by  Filt  AW 
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Spectral 

Magnitude 

(dB) 
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□  Without  CC 
iJwith  CC 


1/2  chip 

D&M  Receiver  Delay 


1  chip 


Figure  3-4:  Magnitude  of  chip  rate  frequency  spectral  lines  for  Sine  pulse  shaped  by  Filter  AW 

Similar  observations  were  made  for  simulations  with  D&M  receivers  processing  a  spread  flip-wave 
signal  with  sine  shaped  HBE  pulses  described  by  Equation  (2.17).  These  results  are  shown  in 
Figure  A-7c  and  Figure  A-7d  as  was  done  when  HBE  pulses  from  Equation  (2.16)  were  used.  The 
only  difference  here  is  that  the  main  lobe  bandwidth  of  the  output  spectrum  varies  between 
approximately  8*fc  and  I0*fc.  Figure  3-4  show's  the  changing  strength  of  the  spectral  component  at 
fc  as  a  function  of  delay  for  a  receiver  that  uses  or  bypasses  complex  conjugation. 


Spectral  Strength  at  the  Chip  Rate  Frequency  with  Variable 


Spectral 

Magnitude 

(dB) 


Figure  3-5:  Magnitude  of  chip  rate  frequency  spectral  lines  for  Sine  pulse  shaped  by  Filter  B 

Figure  A-8a  and  Figure  A-8b  show  simulation  results  for  D&M  receivers  processing  a  spread  flip- 
wave  signal  with  sine  shaped  HBE  pulses  described  by  Equation  (2.18).  Observations  made  for 
simulations  of  D&M  receivers  processing  a  spread  flip-wave  signal  with  sine  shaped  HBE  pulses 
described  by  Equation  (2. 18)  are. 
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•  A  main  lobe  of  the  output  spectrum  is  not  clearl\'  visible. 

•  For  receiver  dela\  s  around  6  to  8  samples  (20  samples  are  ec|ual  to  1  chip  duration  interval) 
the  spectral  line  of  the  chip  rate  frequency  docs  not  exceed  the  spectral  strength  of  surrounding 
spectral  lobes. 

•  A  spectral  magnitude  between  -45dB  and  -50  dB  is  considered  to  be  the  level  at  which  the 
spectral  lines  are  not  detectable  w  hen  compared  to  the  rest  of  the  spectaim. 

•  The  spectral  lines  at  the  output  of  the  system  with  CC  for  delays  smaller  than  the  mentioned  14 
samples  are  all  clearly  visible,  but  the  strength  may  not  ahvaj  s  exceed  the  set  -50  dB  minimum 
level  especially  for  the  case  of  smaller  delays. 

Figure  3-5  shows  the  changing  strength  of  the  spectral  component  at  fc  as  a  function  of  delay  for  a 

receiver  that  uses  or  bspasses  complex  conjugation. 


Spectral  Strength  at  the  Chip  Rate  Frequency  with  Variable 


Spectral 

Magnitude 

(dB) 


Figure  3-6:  Magnitude  of  chip  rate  frequency  spectral  lines  for  Sine  pulse  shaped  by  Filter  BW 

Figure  A-8c  and  Figure  A-8d  support  the  following  observations.  Difference  between  D&M 
receivers  processing  a  spread  flip-wave  signal  with  sine  shaped  HBE  pulses  described  by  Equation 
(2.18)  and  receivers  processing  the  Hanning  weighted  sine  shaped  pulses  from  Equation  (2.19)  is 
only  observ'able  in  small  spectral  strength  differences  at  the  frequencies  of  interest.  Figure  3-6 
shows  the  changing  strength  of  the  spectral  component  at  fc  as  a  ftinction  of  dela>'  for  a  receiver 
that  uses  or  b\passcs  complex  conjugation. 
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Spectral  Strength  at  the  Chip  Rate  Frequency  with  Variable 


Spectral 

Magnitude 

(dB) 


Without  CC 
With  CC 


Figure  3-7:  Magnitude  of  chip  rate  frequency  spectral  lines  for  Sine  pulse  shaped  by  Filter  C 

Simulation  results  displayed  in  Figure  A-9a  and  Figure  A-9b  show  the  case  of  D&M  receivers 

processing  a  spread  flip-wave  signal  with  sine  shaped  HBE  pulses  described  by  Equation  (2.20). 

Tlie  following  observations  can  be  made. 

•  A  main  lobe  of  the  output  spectrum  is  not  clearly  visible. 

•  For  receiver  dela\s  around  6  to  8  samples  (20  samples  is  equal  to  T  in  time  duration)  the 
spectral  line  of  the  chip  rate  frcquenc\'  does  not  exceed  the  spectral  strength  of  surrounding 
spectral  lobes. 

•  Due  to  difTerent  strengths  of  the  spectral  lobes,  a  spectral  strength  beUveen  approximately  - 
45dB  and  -30  dB  is  considered  to  be  the  level  at  which  the  spectral  lines  are  not  detectable 
when  compared  to  the  rest  of  the  spectnim. 

•  Tlie  spectral  lines  at  the  output  of  the  s\  stem  with  CC  for  dela\'S  smaller  than  the  mentioned  14 
samples  are  all  clearly  visible,  but  the  strength  may  not  always  exceed  the  set  -50  dB  minimum 
level  especialh’  for  the  case  of  smaller  delays. 

Figure  3-7  shows  the  changing  strength  of  the  spectral  component  at  fc  as  a  fimction  of  delay  for  a 

receiver  that  uses  or  bypasses  complex  conjugation. 
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Spectral  Strength  at  the  Chip  Rate  Frequency  with  Variable 
Receiver  Delay  for  Sine  Pulse  shaped  by  Filt  CW 

-10 
-20 

Spectral  -30 
Magnitude  -40 
(dB)  -50 
-60 
-70 
-80 . 

0  1/2  chip  1  chip 

D&M  Receiver  Delay 


Figure  3-8;  Magnitude  of  chip  rate  frequency  spectral  lines  for  Sine  pulse  shaped  by  Filter  CW 
When  simulations  are  run  for  D&M  rccei\  ers  processing  a  spread  flip-wave  signal  with  sine 
shaped  HBE  pulses  described  by  Equation  (2.21),  similar  observations  can  be  made  as  for 
receivers  processing  sine  shaped  HBE  pulses  described  by  Equation  (2.20).  The  results  are  shown 
in  Figure  .•\-9c  and  Figure  A-9d.  Figure  3-8  shows  the  changing  strength  of  the  spectral  component 
at  fc  as  a  function  of  delay  for  a  receiver  that  uses  or  bvpasses  complex  conjugation. 

3.2  Noise 

In  practical  situations,  a  signal  is  transmitted  over  a  channel  w'here  the  signal  will  be  susceptible  to 
noise  interference.  Noise  in  the  channel  and  receiver  s\'stem  will  affect  the  performance  of  the 
D&M  receiver.  For  modeling  purposes,  the  noise  is  added  to  the  signal  of  interest  before  it  enters 
the  D&M  receiver.  The  noise-level  and  its  bandwidth  can  be  set  in  the  simulation  of  the  system. 
Normalization  of  the  AM  modulator  with  tliree  parallel  scaled  m-sequence  PN  generators  insures 
that  the  possible  AM  signal  levels  have  a  minimum  set  distance  from  each  other.  To  distinguish  the 
different  levels  of  the  AM  modulator  no  more  noise  is  added  to  the  signal  than  specified  by  the 
minimum  signal  to  noise  ratios  shown  in  Table  3-5  for  the  different  cases  of  chip  weight.  Although 
this  is  not  necessarily  the  case. in  practical  systems,  the  approach  used  insures  that  the  signal  is  not 
buried  in  the  noise.  (See  also  Table  2-1 .)  Therefore  the  added  system  noise  is  limited  such  that 
5AK  =  -201og[£{3:;J  (3.2) 


#  of  levels 

E{aM 

m 

SNR  (dB) 

4 

3 

9 

6 

18 

25 

8 

21 

26 

Table  3-5  SNR  for  different  PN  sequences 


Simulations  are  run  for  a  receiver  operating  SNR  with  values  specified  in  Table  3-5.  Systems 
without  an\'  noise  have  SNR=  co ,  The  selected  noise  bandwidth  is  as  wide  as  the  signal  bandwidth. 
Doubling  of  the  noise  bandwidth  results  in  a  noise  amplitude  spectral  strength  multiplied  by  a 

factor  -^2  in  the  simulations  with  SPW. 
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Random  white  noise  has  a  flat  frequency  spectaim,  where  all  frequencies  are  present  in  equal 
strength.  Tlierefore  such  noise  is  clearly  featureless.  Adding  noise  to  the  signal  results  in  the 
strength  (in  dB)  of  the  system  output  spectrum  to  be  raised  equally  over  the  entire  frequency  range. 
The  lower  the  SNR,  the  more  the  output  spectrum  characteristics  become  that  of  a  featureless 
signal.  However  the  spectral  line  component  at  the  chip  rate  frequencs'.  always  remains  visible  due 
to  the  limitations  placed  on  the  SNR.  Figure  A- 10  of  the  Appendix  shows  the  output  spectrum  of  a 
D&M  receiver  processing  a  signal  without  noise,  the  spectaim  of  pure  noise  and  the  output 
spectrum  of  a  D&M  receiver  processing  a  signal  with  a  minimum  SNR. 

3.3  Multipath  effects 

A  major  source  of  interference  in  communication  applications  is  channel  induced  multipath  which 
results  in  tlie  received  signal  to  be  made  up  of  the  sum  of  scaled  and  delayed  versions  of  the 
transmitted  signal.  The  next  set  of  observations  relate  to  D&M  receiver  simulation  results  with  one 
level  of  multipath.  That  is,  the  original  signal  together  with  one  scaled  and  delayed  replica  arrive  at 
the  receiver.  Figure  A-1 1  shows  some  D&M  receiver  output  spectra  after  processing  signals  with 
various  one  level  multipath  delays  and  magnitudes. 

•  When  the  scaled  and  delayed  component  due  to  multipath  has  a  delay  smaller  or  equal  to  half  a 
chip  duration  interx'al  and  a  magnitude  as  strong  as  the  transmitted  signal,  the  effect  of 
multipath  is  significant  on  the  output  spectaim  of  the  D&M  recen  er  as  the  spectral  lines  at  the 
chip  rate  frequency  and  multiples  of  it  disappear  in  these  cases.  In  Figure  A-1 1  the  D&M 
receiver  output  spectaim  of  a  signal  with  its  duplicate  due  to  multipath  is  shown  where  the 
delay  is  half  a  chipduration  and  the  duplicated  signals'  strength  is  varied. 

•  Systems  with  a  multipath  dela\'  equal  to  multiples  of  the  chip  duration  interval  (20  samples) 
show  spectral  lines  at  the  frequencies  of  interest  in  the  D&M  receiver  output  spectra  that 
remain  unchanged  for  all  multipath  strengths  that  are  weaker  (or  equally  as  strong)  as  the 
transmitted  signal. 

•  Values  of  the  multipath  delays  from  a  few  chip  duration  intervals  and  higher  do  not  cause  as 
much  spectral  line  weakening  as  for  cases  of  multipath  delays  with  low'er  values. 

•  Beyond  the  characteristic  spectral  lines,  the  rest  of  the  spectaim  for  high  values  of  multipath 
dela\'  is  more  flattened  and  the  main  lobe  of  the  spectaim  is  less  observable. 

•  Referring  to  the  different  spectral  strengths  at  the  chip  rate  frequency  for  receivers  that  do  or 
do  not  use  complex  conjugation,  it  is  iinpoaant  to  know  which  txpe  of  HBE  pulse  is  used  and 
to  which  value  the  dela\'  of  the  receiver  is  set  to.  Namely  in  the  case  w  here  HBE  rectangular 
shaped  pulses  are  used,  a  receiver  delay  of  half  a  chip  duration  (10  samples)  shows  a  D&M 
receiver  output  w  ith  spectral  lines  that  are  maximum  at  the  frequencies  of  interest  for  receivers 
that  do  not  use  complex  conjugation,  but  which  are  minimum  for  receivers  that  do  use  CC. 

•  If  the  spectrum  of  the  D&M  receiver  output  does  not  show  any  spectral  lines  when  no 
multipath  effects  are  present,  there  will  also  not  be  any  in  the  case  when  multipath  is  present. 

•  To  look  at  the  effect  of  multipath  on  D&M  receivers  with  complex  conjugation,  a  receiver 
delay  of  1 1  samples  was  chosen  which  resulted  in  close  to  maximum  spectral  lines. 

Since  multipath  is  a  phenomenon  for  which  the  signal  scaling  and  delay  parameters  are  not  known 
to  the  receiver,  random  chosen  values  of  scaling  and  delay  were  used,  in  order  to  simulate  a  more 
realistic  multipath  environment.  In  the  following  simulations,  multipath  is  realized  containing  five 
scaled  and  delayed  signal  replicas.  A  schematic  of  the  block  diagram  used  to  simulate  multipath 
with  more  than  one  dela\ed  and  scaled  signal  replica  is  shown  in  Figure  3-9. 
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Figure  3-9:  Block  diagram  of  multipath  with  five  delayed  and  scaled  signal  replicas 

For  such  simulations,  random  values  had  to  be  chosen  to  set  the  delay  and  scaling  factor  of  each 
signal  replica.  With  a  mathematical  computer  program  (MATLAB),  unifonuly  distributed  random 
numbers  for  the  delay  and  the  scaling  factor  were  generated.  Tlie  delay  of  each  of  the  five  signal 
components  is  a  random  variable  with  values  between  0  and  200  samples,  which  corresponds  to 
values  between  0  and  10  chip  duration  intervals.  Tlie  random  variable  for  the  scaling  factor 
involves  values  in  a  range  of  0  and  1 .5  times  the  original  signal  amplitude.  The  generated  values 
for  the  dela\-  and  scaling  factors  are  presented  in  Table  3-6.  To  compare  the  different  sets  of 
values,  the  multipath  systems  are  labeled  with  numbers.  For  example  the  multipath  system  with 
label  number  1 1  has  signal  duplicates  with  dela>  s  of  3,  63,  139,  23,  64  samples  and  scaling  factors 
of  respectively  of  0.5486,  0.1773,  0.X755,  0.6229,  0.9550  times  the  original  signal.  Some 
particular  spectra  of  D&M  receiver  outputs  processing  a  multipath  signal  can  be  found  in  Figure 
A-12. 


Table  3-6  Multipath  tables  with  their  random  chosen  dcla\’  and  scaling  factor  parameters 
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A  significant  difference  is  obsen  able  between  the  output  produced  by  receivers  with  CC  and 
receivers  without  CC. 

•  Receivers  not  using  complex  conjugation  show  spectral  lines  at  the  chip  rate  frequency  and 
frequencies  that  are  multiples  of  f  ^  in  any  of  the  above  defined  multipath  cases. 

•  Receivers  using  complex  conjugation  that  process  the  above  defined  multipath  signals  can 
show  spectra  without  any  spectral  lines  at  the  frequencies  ot  interest  for  the  cases  where  the 
multipath  cases  labeled  as  12,  16  and  19  are  used,  or  they  show  only  a  few  spectral  lines  that 
are  multiples  of  the  chip  rate  frequency  for  the  multipath  cases  labeled  as  1 1  and  18.  The 
output  spcctaim  of  receivers  w  ith  CC,  and  multipath  cases  labeled  as  13  and  14  show'  some 
weak  spectral  lines  while  for  multipath  cases  labeled  as  15  and  17  the  effect  of  multipath  is  not 
significant  and  all  spectral  lines  appear  at  the  output. 

3.4  Filtering  and  Integration 

Figure  3-9  shows  the  block  diagram  that  describes  the  system  used  to  detect  the  presence  of  the 
flip-W'ave  spread  spectrum  signal.  In  Figure  A-13,  simulation  results  are  showm  displaying  signals 
at  the  output  of  the  different  blocks  of  the  system  used  for  signal  presence  detection.  The  output  of 
the  D&M  receiver  processing  the  spread  flip-wave  signal  is  follow'ed  of  by  a  narrow'  band  filter,  a 
squarer.  an  integrator  and  a  threshold  detector.  The  frequencies  for  the  flip-wave  input  signal, 
random  data  switch,  and  chip  rate  are  respectively  2000Hz,  400Hz,  256,000Hz  or  in  a  5: 1:640 
ratio.  Tliat  means  that  5  data  bits  arc  contained  in  1  data  switch  interval. 


Figure  3-10:  Signal  detection  block  diagram  s\stem  design 


Table  3-7  shows  the  various  parameters  associated  w  ith  the  signal  presence  detection  problem, 
such  as  the  required  threshold  le\'els,  filter  bandwidth,  noise,  and  multipath.  Several  simulations 
were  used  to  determine  effective  parameter  settings,  such  as,  SNR  \  alues  of  the  additive  Gaussian 
noise  and  the  bandw  idth  of  the  narrow  band  filter.  The  effect  ot  channel  induced  multipath  is  also 
considered.  The  giv  en  threshold  levels  are  obtained  after  comparing  the  highs  and  lows  of  the  peaks 
of  the  integrator-detector  w  ith  the  known  logical  O's  and  1  's  random  sw  itch  states  for  each  of  the 
30  simulated  switch  state  intervals.  The  separation  between  the  highest  0  and  the  lowest  1 
detemiines  how  precisely  the  threshold  level  has  to  be  set.  A  logarithmic  separation  is  defined  by 
CT  =  Log[Lovvest  l]  -  Log[Hig  best  o]  (3.3) 

Approximate  \  alues  are  given  for  tlic  thresliold  level  used  to  obtain  simple  relationships  of  the 
parameter  settings  with  respect  to  the  needed  threshold  level. 
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Tabic  3-7  Threshold  level  extracted  from  logical  O's  and  I's 


Tlie  range  of  integrated  output  values  for  logical  O's  is  bigger  (on  a  logarithmic  scale)  than  that  for 
the  logical  1  's.  The  use  of  complex  conjugation  in  the  D&M  Receiver  does  not  setting  the  threshold 
level.  Only  the  spectrum  (of  the  D<S:M  output)  appears  different  when  referring  to  the  spectral  line 
at  0  Hz.  Two  diftcrcnt  multipath  cases  labeled  as  number  1 1  and  number  12  are  included  in  the 
threshold  level  simulations.  These  tw  o  multipath  cases  have  an  effect  on  the  performance  of  the 
D&M  receiver  that  is  different  in  each  case.  In  anaK  zing  the  performance  of  signal  presence 
detection  b\'  threshold  detection,  the  different  values  of  the  tw  o  multipath  s>  stem  parameters  do  not 
seem  to  affect  the  performance.  Tlie  values  set  for  the  threshold  level  to  detect  signal  presence  are 
shown  in  the  graph  of  Figure  3-11.  Figure  3-12  shows  the  separation  defined  by  Equation  3.3  for 
simulated  cases. 


4-25 


Threshold  level 


1  E+01 
1  E-02 
1  E4)5 
1  E-08 
1  E-11 
1  E-14 
Filter 


SNR  SNR 

No  MultiPath^^ 


Figure  3-1 1;  Used  Tlireshold  level  for  the  analyzed  systems 
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Figure  3-12:  Separation  available  for  selecting  threshold  level 

A  negative  separation  indicates  that  the  highest  logical  0  observed  after  the  integration  is  higher 
than  the  lowest  observed  logical  1 .  In  that  case  the  threshold  detector  can  not  be  set  to  a  level  that 
results  in  a  100%  error  free  signal  presence  detection.  Using  a  larger  filter  bandwidth  and  higher 
SNR,  in  general  enlarges  the  separation  and  reduces  the  likelihood  of  signal  presence  detection 
errors. 
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4  Conclusions  and  Recommendations 

The  features  of  a  delay  and  multiply  receiver  are  described  in  Section  1.3.  (More  details  are  found 
in  [1,  5]).  Tlie  performance  of  delay  and  multiply  receivers  processing  low  probability  of  intercept 
signals  (described  in  Section  2)  is  tested  under  different  conditions  in  order  to  study: 

1 .  the  effect  of  the  delay  setting  of  the  D&M  receiver  varying  from  0  to  1  chip  duration  interval. 
This  is  evaluated  for  signals  spread  by  various  shapes  of  HBE  pulses  imposed  on  PN  sequence 
generator  AM  signals. 

2.  the  effect  of  noise  added  to  the  input  signal  that  is  processed  by  the  D&M  receiver. 

3.  the  effect  of  multipath  on  the  performance  of  the  D&M  receiver  processing  an  input  signal  that 
is  degraded  by  channel  induced  multipath. 

Furthermore,  post  processing  of  the  D&M  receiver  output  by  a  combination  of  a  narrow  band 
filter,  squarer,  and  integrator  that  is  used  for  signal  presence  detection  is  investigated  from  a 
performance  standpoint. 

The  delay  of  the  receiver  is  evaluated  for  DSSS  signals  uith  raised  cosine,  rectangular,  and  several 
sine  shaped  HBE  pulses.  Tlie  variation  in  the  spectral  line  strength  at  die  chip  rate  frequency  for 
the  different  shaped  HBE  pulses  confirm  that  an  appropriate  delay  setting  in  the  D&M  receiver 
depends  partly  on  the  kind  of  HBE  pulse  used. 

The  use  of  complex  conjugation  affects  the  performance  of  the  receiver  definitely  under  the  various 
operational  scenarios  studied. 

Not  only  the  magnitude  of  the  spectral  lines  are  significant  when  it  comes  to  signal  detection,  but 
also  the  shape  of  the  total  spectrum,  and  in  particular  the  power  and  bandwidth  of  the  main  lobe 
are  affected.  White  Gaussian  noise  added  to  the  system  makes  the  simulations  more  realistic,  but 
does  not  significantly  affect  the  performance  of  the  receiver  as  long  as  specific  signal  to  noise 
ratios  are  maintained.  Interference  in  the  form  of  multipath  propagation  makes  the  performance  of 
the  delay  and  multiply  receiver  unpredictable.  However  the  long  observation  time  detector  in  the 
form  of  the  threshold  detector  using  post  D&M  receiver  processing  is  a  remarkably  reliable  signal 
presence  detector  if  the  threshold  level  is  appropriately  set.  The  performance  of  the  threshold 
detector  depends  strongly  on  the  filter  bandwidth  used  and  noise  level  of  the  system.  The  presence 
of  multipath  has  sometimes  a  stabilizing  effect  on  the  signal  presence  detection  performance. 
Tlierefore,  further  studies  into  automatic  threshold  level  settings  for  given  bandwidth  and  SNR 
values  are  necessary'  to  completely  specify  the  implementation  of  such  a  post  D&M  receiver 
processor. 
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Figure  A-5b;  SAP,  Raised  Cosine  D&M  output  with  CC  with  variable  delay 
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Figure  A-6a;  SAP.  Rctangular  shaped  pulses  D&M  output  without  CC  with  \  ariable  dela\' 


Figure  A-6b:  SAP.  Retangular  shaped  pulses  D&M  output  with  CC  with  \ariable  delay 
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Figure  A-7b:  SAP.  Sine  shaped  pulses  of  duration  T=4Tc  D&M  output  with  variable  delay 
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Figure  A‘7c;  SAP,  Sine  shaped  pulses  of  duration  T--4Tc  DitM  output  with  \’ariable  delay 


Figure  A-8a:  SAP.  Sine  shaped  pulses  of  duration  T=8Tc  D&M  output  with  variable  delay 


Figure  A-8b:  SAP,  Sine  shaped  pulses  of  duration  T=8To  D&M  output  with  variable  delay 
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Figure  A-8c:  SAP,  Sine  shaped  pulses  of  duration  T=STc  D&M  output  with  variable  delay 


Figure  .A-8d:  SAP,  Sme  shaped  pulses  of  duration  T=8Tc  D&M  output  with  variable  delay 
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Figure  A-9a:  SAP.  Sine  shaped  pulses  of  duration  T=16Tc  D&M  output  with  variable  delay 
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Figure  A-9c:  SAP,  Sine  shaped  pulses  of  duration  T=16Tc  D&M  output  with  \  ariable  delay 


Figure  A-9d:  SAP,  Sine  shaped  pulses  of  duration  T~16Tc  D&M  output  with  \ariablc  dclav 
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Figure  A- 10:  SAP,  Noise  simulation  Figure  A-1 1 :  SAP.  Multipath  with  one  delayed 


and  scaled  duplicate 


Figure  A- 12:  SAP,  Multipath  with  5  dcla\ed  and  scaled  duplicates 
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NON-DESTRUCTIVE  AND  OPTICAL  CHARACTERIZATION  OF  COMPOSITION 
AND  THICKNESS  IN  MULTILAYER  TERNARY  SEMICONDUCTOR  STACKS 


Xuesheng  Chen,  Assistant  Professor 
Department  of  Physics  and  Astronomy 
Wheaton  College 
East  Main  St.,  Norton,  MA  02766 

Abstract 

Multilayer  Ini.xGaxAs/InP  structures  have  found  wide  application  in  high-speed 
electronic  and  optical  devices.  The  composition  x,  the  layer  thickness,  and  their  uniformity  in 
the  structure  are  crucial  in  obtaining  desirable  device  performance.  In  this  report,  we  first 
present  the  spectoreflectance  technique  to  determine  the  composition  x  and  the  layer  thickness 
of  InxGai-xAs  in  a  multilayer  stack.  Then,  we  describe  the  photoluminescence  method  to 
determine  the  composition  and  its  uniformity  across  the  wafer,  and  to  describe  the  important 
role  photoluminescence  can  play  in  obtaining  an  accurate  refractive  index  function  n(x,X)  for 
Ini-xGaxAs. 
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NON-DESTRUCTIVE  AND  OPTICAL  CHARACTERIZATION  OF  COMPOSITION 
AND  THICKNESS  IN  MULTILAYER  TERNARY  SEMICONDUCTOR  STACKS 


Xuesheng  Chen 


Introduction 

Most  semiconductor  devices  are  optimized  by  heterojunctions,  which  are  commonly 
achieved  through  the  use  of  ternary  semiconductor  epitaxial  layers  such  as  lUxGai-xAs. 
Epitaxial  layers  are  the  building  blocks  in  optoelectronic  device  fabrication.  Due  to  its 
superior  electronic  properties,  the  ternary  semiconductor  InxGai.xAs  has  found  wide 
applications  in  high-speed  electronic  and  optical  devices  such  as  p-i-n  detectors,  avalanche 
photodiodes,  and  long  wavelength  diode  lasers.  The  ternary  composition  x  is  crucial  in 
obtaining  desirable  device  performance,  and  the  uniformity  of  each  epitaxial  layer  affects 
device  yields.  Routine  ternary  composition  and  thickness  measurements  by  TEM,  optical 
microscopy  and  double  crystal  X-ray  diffraction  (DCXRD)  are  usually  destructive,  labor 
intensive,  and  time-consuming  when  applied  to  evaluate  large  area  uniformity.  The  optical 
reflectance  mapping  method  developed  by  Weybume  and  his  collabrators  at  Air  Force 
Research  Laboratory  -  Hanscom  AFB  has  recently  been  shown  to  have  this  kind  of  desirable 
property  for  AlAs/GaAs  and  Ali.xGaxAs  /GaAs  multilayer  systems  [1,2].  I  joined  Dr. 
Weybume’s  group  during  the  summer  of  1996,  as  an  AFOSR  Summer  Faculty  Research 
Associate,  and  extended  this  method  to  Ini.xGaxAs/InP  multilayer  stacks  to  see  if  the 
composition,  thickness  and  their  uniformity  of  each  layer  can  be  determined  accurately.  It  is 
essential  with  this  method  to  have  a  model  to  describe  the  dependence  of  the  refractive  index 
n  on  the  composition  x  and  photon  wavelength  X  for  InxGai-xAs  because  the  optical 
reflectance  depends  on  n(x,X,)  and  layer  thickness  in  the  stack.  In  literature,  there  are  models 
for  n(x,X,),  but  only  for  lattice  matched  composition  x=0.53.  During  the  summer  1996,  I 
developed  a  model  for  n(x,?i)  by  modifying  Jensen’s  model  [3].  Comparing  with  modified 
Adachi’s  model  [4],  however,  I  found  that  the  deduced  composition  and  thickness  for  a 
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InxGai.xAs  layer  in  the  stack  from  optical  reflectance  are  model  dependent.  There  are  not 
enough  reliable  experimental  indices  available  in  the  literature  in  our  composition  and 
wavelength  range  for  us  to  determine  which  model  is  correct.  Here,  in  this  work,  we  (I  was 
collaborating  with  Dr.Weybume  and  Dr.  Paduano)  show  that  it  is  possible  to  determine  the 
refractive  indices  n  or  effective  n  for  the  epitaxial  layer  InxGai.xAs  on  InP  substrate  in  the 
composition  x  and  wavelength  X  range  we  are  interested  in,  and  to  obtain  a  reliable  formula  to 
describe  n(x,A,),  using  combination  of  spectroreflectance,  photoluminescence  and  X-ray 
diffraction.  In  this  report,  we  first  describe  the  optical  reflectance  technique  to  determine  the 
composition  and  thickness  of  InxGai-xAs  in  the  multilayer  stack.  Then,  we  describe  the 
photoluminescence  (PL)  method  to  determine  the  composition  and  its  unifoitnity  across  the 
wafer,  and  the  important  role  photoluminescence  can  play  in  determine  accurate  n(x,A,). 


Spectroreflectance 

A.  Theoretical  Reflectance 

The  reflectance  R  from  a  multilayer  stack  such  as  lUxGai-xAs  /InP  is  given  by  the 
equation  below  [5]: 


E 

Y 


n  m 

(nMj)M,{nMp 


j=i 


j=i 


where  the  matrices  Mj  are  given  by: 


(2) 
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(exp  i5j  +  exp-  idj ) /  (exp  idj  -  exp-  it 

/l 

rjj  (exp  i5j  -  exp-  iSj ) /  (exp  iSj  +  exp- 1 

/2 

The  meanings  of  those  symbols  in  eqs.(l)  to  (3)  are  the  following: 
j  denotes  the  j-th  layer, 
s  denotes  the  substrate, 

0  denotes  the  air, 

c  denotes  the  middle  cavity  layer  in  a  multilayer  stack, 

n  denotes  number  of  pairs  (e.g.  lUxGai.xAs  /InP )  above  the  cavity, 

m  denotes  number  of  pairs  below  the  cavity  and  above  the  substrate, 

5j  =  27tNjdjCOs6j/X,,  (k  is  wavelength,  and  dj  is  the  j-th  layer  thickness,  0j  is  the 
incident  angle  to  interface  of  the  j-th  and  G+l)-th  layers), 

Nj  =  n  -  ik,  (n  and  k  are  the  j-th  layer  refractive  index  and  extinction 
coefficients,  respectively), 

rij  =  Nj  cos  0j  for  TE  mode  or  Tjj  =  Nj  /cos  0j  for  TM  mode. 

As  you  can  see,  the  reflectance  R  depends  on  refractive  index  n(x,?i),  thickness  d  of  each 
layer,  and  the  wavelength  k.  If  n(x,X)  is  known  for  each  epitaxial  layer  in  the  multilayer  stack, 
its  composition  x  and  thickness  d  can  be  deduced  by  treating  them  as  adjustable  parameters  to 
get  the  best  fit  of  the  experimental  reflectance  curve  to  the  theoretical  reflectance  R  that  was 
described  by  eq.(l). 

Fig.  1  shows  the  simulated  reflectance  curve  with  three  different  compositions  by  using 
the  modified  Adachi’s  model  [2,4]  for  n(x,X).  The  curve  shifts  obviously  to  the  up  and  the 
right  with  small  increment  in  composition  and  is  very  sensitive  to  the  composition  change  for  a 
stack  similar  to  the  structure  shown  in  Fig.2.  It  was  shown  that  the  optical  reflectance  is  also 
highly  sensitive  to  the  thickness  of  each  epitaxial  layer  in  a  multilayer  AlAs/GaAs  stack  [1]. 
The  simulated  reflectance  curves  for  the  stack  with  three  different  GaAs  layer  thickness  are 
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shown  in  Fig.3.  The  dashed  lines  indicate  the  effects  of  increasing  the  GaAs  layer  thickness  by 
+0.5,  +1.0,  and  +2.0%,  respectively. 

B.  Results  and  Discussions 

The  work  was  done  on  multilayer  InxGai-xAs/InP  stacks  that  were  grown  by  MOCVD  at 
Hanscom  AFB.  Undoped  InP  and  InxGai-xAs  layers  were  grown  using 
trimethylaluminum  (TMI),  triethylgallium  (TEG),  phosphine,  and  arsine  as  sources.  Single 
side  polished,  (lOO)-oriented,  serai  insulating  InP  substrates  were  used.  The  test  structures 
consist  of  7  or  9  pairs  of  InxGai.xAs/InP  and  a  half  wavelength  cavity  layer  of  InxGai.xAs  (see 
Fig.2).  The  typical  thickness  for  the  pair  is  130  nm  and  160  nm  for  InxGai-xAs  and  InP, 
respectively,  and  the  cavity  has  a  thickness  of  260  nm.  To  avoid  light  adsorption  by  the 
layers,  we  choose  the  wavelength  range  of  1600  nm  to  2200  nm  which  is  mostly  in  the 
transparent  region  just  above  the  LixGai.xAs  bandgap  wavelength.  The  measurement  range  is 
kept  near  the  InxGai-xAs  bandgap  so  that  the  index  of  reflection  will  have  significant 
wavelength  dependence.  This  insures  that  the  fitted  composition  will  be  unique.  Choosing  a 
center  wavelength  around  1900  nm  means  that  the  quarter- wave  InP  stack  thickness  and  the 
InxGai-xAs  stack  thickness  can  be  calculated  by 

=  ,  (4) 

where  d  is  the  layer  thickness,  ^.center  is  the  center  wavelength,  and  n  is  the  appropriate 
index  of  refraction  evaluated  at  Xcenter-  The  half-wave  InGaAs  cavity  layer  on  top  of  the  stack 
is  grown  to  be  twice  the  InGaAs  stack  thickness. 

The  index  of  refraction  of  lattice-matched  InGaAs  was  calculated  according  to  Adachi 
[4].  The  parameters  were  adjusted  to  account  for  small  variation  in  the  composition  around 
this  value  by  fitting  the  experiment  data  to  Adachi’s  model.  The  index  of  InP  is  obtained  by 
fitting  data  from  Palik  [7]  (1000-2200  nm)  to  a  semi-empirical  Sellmerier  type  equation  as 
shown  below: 
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where  A,  is  in  nm.  Then  the  experimental  reflectance  curve  was  fitted  to  eq.(l)  to  deduce  the 
composition  and  thickness.  The  experimental  and  fitted  reflectance  spectra  for  7-pair  and  9- 
pair  InxGai-xAs/InP  structures  are  shown  in  Figs.4  and  5.  The  minimum  at  the  center  of  the 
high  reflectance  zone  corresponds  to  the  cavity  resonance.  This  minimum  location  is  a  good 
approximation  of  Acemer  and  provides  a  convenient  method  of  estimating  initial  values  for  the 
spectrum  fitting.  The  deduced  composition  x  and  layer  thickness  are  shown  in  Figs.  6  and  7 
for  a  7-pair  stack.  In  terms  of  composition  variation  with  respect  to  the  lattice-matched 
composition,  a  variation  range  of  -0.2%  to  +0.7%  is  observed  from  the  center  to  the  edge  of 
the  wafer.  For  85%  of  the  total  area  at  the  center  of  wafer,  the  thickness  variation  is  <  0.7%. 
Excluding  4  mm  from  the  edge  of  the  wafer,  the  uniformity  is  within  ±0.5%.  Therefore,  this 
InxGai-xAs  layer  is  lattice  matched  to  InP  substrate  over  90%  of  the  wafer  area. 

Spectroreflectance  method  described  here  is  very  sensitive  to  the  composition  and 
thickness  as  long  as  a  structure  similar  to  Fig.2  is  used.  The  accuracy  of  the  thickness,  and 
especially  the  composition  (x),  however,  depends  on  the  model  for  n(x,A).  See  Appendix  for 
details. 


Photoluminescence  Method 

It  was  shown  that  the  wavelength  position  at  the  half  of  photoluminescence  (PL)  peak 
intensity  shifts  with  the  composition  x  for  Ini.xGaxAs[7].  It  would  be  easy  and  non-destructive 
if  we  can  use  this  wavelength  position  to  determine  the  composition  x  and  its  uniformity  of  the 
Ini-xGaxAs/InP  stack.  In  order  to  find  out  how  exactly  the  half-PL-peak  intensity  wavelength 
position  changes  with  composition  x,  PL  spectra  from  more  than  a  dozen  of  single-layer  Inj. 
xGaxAs  layer  on  InP  substrate  were  measured.  The  composition  x  for  each  of  these  samples 
was  determined  by  double  crystal  X-ray  diffraction  (see  Fig.8).  Fig.9  shows  PL  spectra  from 
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3  samples  with  different  x  (xl=54.5%,dash;  x2=52.2%,solid;  x3=49.4%,dots).  The  notch  in 
the  center  of  the  PL  spectra  is  due  to  a  notch  in  the  grating.  Samples  are  excited  by  a  diode 
laser  at  674nm  with  lOmW.  As  you  can  see,  the  wavelength  position  at  half  PL-peak  intensity 
on  the  lower  energy  side  changes  a  lot  with  composition  x.  A  curve  describing  the  dependence 
of  this  wavelength  position  on  composition  x  is  obtained  from  the  PL  data,  shown  in  Fig.  10. 

Double  crystal  X-ray  diffraction  (DCXD)  can  not  easily  determine  composition  x  in  a 
Ini-xGaxAs/InP  stack.  It  is,  however,  very  convenient  to  measure  PL  and  use  Fig.  10  to 
determine  the  x.  To  see  how  accurate  with  this  method,  PL  was  measured  for  a  9-pair  Ini. 
xGaxAs/InP  stack  and  x  was  then  determined  from  Fig.  10  to  be  52.6%,  comparing  with  52.8% 
determined  by  DCXD.  It  is  hard  to  determine  which  number  is  more  accurate  because 
DCXD  cannot  provide  composition  directly  for  a  multilayer  stack  (assumptions  of  each  layer 
thickness  have  to  be  made).  The  composition  uniformity  map  for  a  9-pair  Ini.xGa.xAs/InP 
stack  using  PL  method  is  shown  in  Fig.  1 1 .  The  PL  peak  intensity  map  is  shown  in  Fig.  12. 

It  seems  that  the  composition  can  be  determined  easily  and  accurately  using  PL 
method  if  Fig.  10  is  accurate.  Accuracy  of  Fig  10  depends  on  if  the  compositions  of  single¬ 
layered  Ini.xGaxAs  on  InP  substrate  can  be  accurately  determined  by  X-ray.  Also  PL  and  X- 
ray  have  to  be  done  on  the  same  spot  of  the  sample.  PL  should  be  obtained  by  exciting  the 
sample  with  a  low  power  light  source  because  the  wavelength  position  at  half-PL-peak 
intensity  changes  with  the  excitation  power  that  is  above  certain  limit.  Fig.  13  shows  that  PL 
shapes,  positions,  and  width  hardly  change  when  excited  by  a  diode  laser  with  powers  below 
16.5mW.  Fig.  14,  however,  shows  that  they  do  change  when  excited  by  a  Argon  laser  with 
powers  that  range  from  4  to  250mW.  To  be  accurate,  same  type  of  low-power  excitation 
should  be  consistently  used  for  the  establish  of  Fig.  10  and  for  PL  measurements  on  to-be- 
examined  stacks 

The  PL  method  does  not  seem  to  be  able  to  determine  easily  the  layer  thickness  in  the 
stack.  The  spectroreflectance  is,  however,  very  sensitive  to  the  thickness,  as  shown  in  Fig.3. 
The  spectroreflectance  technique  relies  on  an  accurate  formula  of  n(x,>,),  which  can  be 
obtained  from  reflectivity  and  PL.  PL  can  be  used  to  determine  composition  x  of  In^xGaxAs, 
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and  the  reflectance  can  be  used  to  determine  a  formula  of  n(A,).  Using  samples  with  different 
X,  we  should  be  able  to  develop  an  accurate  refractive  index  function  n(x,A,). 
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Appendix 

It  is  essential  to  have  a  theoretical  model  to  describe  the  dependence  of  the  refractive 
index  non  the  composition  x  and  the  wavelength  X  for  InxGai-xAs  in  order  to  use  eq.(l).  We 
developed  a  model  for  n(x,?i)  of  lUxGai-xAs  by  modifying  B.  Jensen’s  model  [3].  It  uses  a 
quantum  mechanics  calculations  of  the  dielectric  constant  of  a  compound  semiconductor  and 
assumes  the  band  structure  of  Kane  Theory.  The  theoretical  expressions  for  n(x,A,)  is  given  in 
terms  of  the  basic  material  parameters  of  band  gap  energy  Eg,  effective  electron  mass  mn, 
effective  hole  mass  m  p,  spin  orbit  splitting  energy  A,  and  lattice  constant  a.  In  the 
nonabsorbing  range,  the  n(x,X.)  of  InxGai.xAs  can  be  described  by 

n^  =  1  +  2Co  {  (Yb-Yf)  -  z(  tan'^CYB  /z)  -  tan'^Yp/z)  }  ,  (6) 

where 

Yb  =  mo(a-ao) , 

mo  =  2.93  A , 
a  =(l-x)aGaAs  +  xai„As, 
acaAs  =  5.6534  A,  a^As  =  6.0585  A , 
z  =  [l-(froyEg)]^  , 
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(0=  2  k  (c/X.),  ( c  =  speed  of  light ) 

Eg  =  1.43  -  1.53x  +  0.45  , 

Co  =  ( cOvVco^^), 

ct^  =  Eg/ll , 

tOv  =  4Ke^Nv*/mn, 

Nv*  =  Nv(mymn) 

Nv  =  8/3kX\ 

^=-fi7(Egmn/2)'^ 

l/irir  =  l/nin  +  1/mp , 

mn=  0.07(1 -x)me  +  0.028xmc , 
nip  =  0.5(l-x)mp  +  0.33xmp , 
reie  =  9.10939  x  10'^*  gram, 

YF  =  2(ru/Nv)‘^ 

He  =  6.5  X  10*®  cm'^  ,  (carrier  concentration) . 

All  the  formulas  above  for  n(x,^)  are  in  cgs  units. 

Plugging  eq.(6)  into  eq.(l),  the  theoretical  reflectance  R(x,A.)  for  a  multilayer  InxGai 
xAs/InP  stack  can  be  obtained. 

The  composition  x  and  the  thickness  of  the  InxGauxAs  and  InP  in  the  7  pairs  and  the 
thickness  of  the  InxGai-xAs  cavity  can  be  easily  found  by  treating  them  as  adjustable 
parameters  to  get  the  best  fit  of  the  experimental  reflectivity  curve  to  the  theoretical  curve 
described  by  eq.(l).  For  a  sample  of  a  7-pair  InxGai.xAs(~130nm)/InP(~160nm)  with  a  half 
lUxGai-xAs  cavity  on  top,  we  obtained  from  the  fitting: 

thickness  dlnGaAsinpaire~1378A,  thickness  djnP  in  pairs” 1547 A, 
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thickness  dinGaAscavity=2553A,  composition  x  inxCai-xAs  =  0.555. 

These  values  are  close  to  the  targeted  growth  values. 

However,  when  the  same  experimental  curve  was  fitted  to  the  Adachi’s  model  [6],  we 
obtained: 

thickness  dluGaAsinpairs^l^l^-A,  thlCkneSS  dinP  in  pairs— 1502 A, 

thickness  dinGaAscavity=2557A,  composition  x  inxGai-xAs  =  0.578. 

The  two  sets  of  the  deduced  values  are  different,  especially  in  the  composition,  due  to 
the  two  different  models  for  the  n(x,A,)  of  InxGai-xAs. 
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Fig.l  Simulated  reflectivity  curves  for  a  IrixOai.^As/InP  multilayer  stack  -with  three 
different  In  compositions  using  the  modified  Adachi’s  model  [2,4]  for  the  refractive 
index  n(x,A,). 
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Experimental  (solid)  and  theoretical  (dash)  reflectance  spectra  from 
a  9-pair  InxGai-xAs/InP  stack. 
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Fig. 9.  Photoluminescence  spectra  from  3  samples  of  In^Ga.,  ^As  on  InP  substrate 
with  different  indium  composition:  x1=54.5%,  x2=52.2%,  and  x3=49.4%. 
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Fig.  11.  Composition  (x)  map  using  PL  method 
for  a  9-pair  InxGai-xAs/InP  stack. 
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Fig.  13.  Photoluminescence  spectra  at  different  diode  laser  powers 
from  a  single-layer  In^Ga^  ^As  on  InP  substrate. 
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ABSTRACT 


We  report  the  to  development  and  evaluation  of  a  thin  film  anti-reflection  coating  for  use 
on  gallium  arsenide  that  enhances  the  coupling  of  laser  energy,  at  specific  frequencies, 
into  that  material.  The  layers  is  of  the  single  index,  quarter  wave  type,  matched  to  specific 
semiconductor  specimens  and  to  laser  wavelengths  used  for  optical  excitation  of  various 
semiconductors.  The  purpose  of  this  development  is  to  increase  the  laser  power  coupling 
into  the  semiconductor  specimens  and  thereby  increase  the  radiated  E-M  field  strengths  of 
such  elements  when  used  for  flyable,  reconfigurable,  |i-wave  source/antenna  arrays.  This 
investigation  compliments  the  ongoing  project  at  AFOSR  Sensors  Technology  Branch  to 
develop  semiconductor,  E-M  source/antenna  elements  for  two  and  three  dimensional 
radar  arrays  for  airborne  systems  based  on  the  concept. 

INTRODUCTION 

Optically  excited  semiconductor  photo  carriers,  accelerated  in  a  dc  field,  was 
suggested  some  years  ago  [1]  as  a  potential  source  for  wide  band  microwave  pulses  in  the 
pico-second  width  range.  Such  sources  would  have  a  time  domain  width  controlled 
(approximately)  by  the  duration  of  the  optical  pulse  and  by  the  semiconductor  photo 
carrier  lifetime.  Optical  excitation  permits  multiple  E-M  source  generation  via  splitting 
of  a  single  laser  beam.  Phase  and  impedance  match  problems  inherent  in  UHF 
microwave  source  -  to  -  antenna  coupling  can  be  significantly  reduced  with  this  scheme. 
In  addition,  the  E-M  sources,  as  described,  could  act  as  their  own  radiative  elements 
(antennae)  thereby  producing  a  compact  array  which  can  be  steered  electro-optically  [2,3] 
rather  than  by  complex  mechanical  and/or  electronic  delay  lines.  This  would  be  a  distinct 
advantage  in  high  “G”  environments.  Cooling  to  at  least  LN2  temperature  is  also  feasible 
for  airborne  applications  and  has  the  potential  for  improving  the  field  strength  of  such 
arrays  by  increasing  the  mobility  and  hence  the  final  velocity  attained  by  the  optically 
induced,  semiconductor  carriers  [6]. 

Research  into  various  aspects  of  the  laser  induced,  pulsed,  picosecond,  E-M 
sources  (LIPPES)  has  proceeded  continuously,  albeit  at  a  low  level,  since  1994.  The 
initial  proof  of  concept  was  demonstrated  by  several  organizations  including  Rome 
Laboratory,  Hanscom  MA  [4].  The  concept,  simply  stated,  is  that  photo  carriers,  induced 
in  a  semiconductor  by  an  optical  laser  pulse  will  accelerate  in  the  presence  of  a  dc. 
electric  field  established  along  the  surface  of  the  semiconductor  (say  between  two  surface 
metallic  contacts  on  a  thin  polished  wafer).  Such  accelerating  carriers  (generally 
electrons)  will  radiate  electromagnetic  fields  in  proportion  to  the  applied  dc.  field  strength 
up  to  some  maximum  velocity  controlled  by  the  intrinsic  semiconductor  parameters  vis- 
a-vis  the  mobility.  Experimental  results  of  the  past  two  years  have  generally  confirmed 
this  hypothesis.  The  general  dc.  field  dependence  has  be  examined  and  reported  recently 
by  the  Liu,  et  al.  at  USAF  Rome  Lab.  Hanscom,  MA  [5].  In  that  study,  E-M  radiation 
field  strength  GaAs  and  InP  were  examined  as  a  function  of  applied  dc.  bias  and  it  was 
demonstrated  that,  for  both  materials,  a  plateau  in  the  radiated  field  was  reached  for  the 
dc.  field  above  some  threshold  -  about  5.5  kV  for  GaAs  and  12  kV  for  InP.  Based  those 
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studies  and  other  information  develop  thus  far,  InGaAs  should  to  be  an  even  better  source 
of  E-M  radiation  producing  stronger  E-M  fields  at  lower  dc.  voltages. 

BACKGROUND 

Recently,  we  completed  a  proof  of  principal  program  to  evaluate  the  next  level  of 
complexity  for  the  concept.  As  part  of  that  evaluation  [7],  a  single  source  was  excited  in 
two  spatially  separated  regions  by  splitting  the  exciting  laser  beam.  In  an  alternative 
configuration,  two  specimens,  stacked  in  physically  in  series  and  biased  separately,  were 
excited  simultaneously  by  the  (split)  laser  beam.  Near  and  far  field  measurements 
confirmed  that  a  phase  relationship  could  be  superimposed  on  the  detector  by  the  opto¬ 
mechanical  relationship  between  the  source(s)  and/or  the  exciting  laser  beam(s).  That  is, 
by  varying  the  physical  position  on  the  surface  and/or  arrival  time  of  the  optical  pulses, 
the  forward  direction  of  the  maximum  E-M  position  could  be  controlled. 

The  results  of  that  study  have  suggested  several  courses  that  might  be  followed  in 
order  to  increase  the  E-M  signal  strength  Eventually,  the  effort  will  devoted  to 
optimizing  the  choice  of  semiconductor,  the  physical  layout  of  the  contacts,  the  profile  for 
the  dc.  accelerating  voltages  and  the  spatial  relationships  between  the  semiconductor 
sources  and  the  light  excitation  beam.  One  parameter  that  is  of  utmost  importance  to  all 
the  measurements,  now  and  in  the  future,  is  the  optical  coupling  between  the  light  source 
and  the  semiconductor.  This  parameter  presents  challenges  even  now  at  this  early  stage 
of  evaluation  because  of  the  nearly  35%  loses  at  the  air/semiconductor  interface  and  the 
50:50  power  division  between  the  two  light  beams  from  a  single  source.  Therefore,  for 
this  study,  we  concentrated  on  the  development  of  an  anti-reflection  coatings  for  one  the 
two  semiconductors  that  are  presently  the  focus  of  the  LIPPES  studies. 

GaAs  REAR 


FIGURE  1:  LIPPES  series  configuration  for  a  two  ‘color’  laser  source 
exciting  two  E-M  radiating  element  in  series  configuration. 

For  the  experiments  performed  to  date,  the  light  source  has  been  a  Q  switched, 
mode  locked,  frequency  doubled,  YLF  laser  with  (ML/QS)  power  of  1.35  Watts  and  pulse 
energy  of  approximately  50|iJ  for  50ps  duration  pulse.  The  wavelengths  available  from 
the  YLF  laser  are  1053nm  and  (frequency  doubled)  526.5nm.  Assuming  laser  light,  at 
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normal  incidence,  impinging  from  an  air  medium  onto  a  non-absorbing,  partially 
reflective,  polished  surface  then  the  reflectivity,  for  that  single  interface  condition,  is 
given  by  the  Fresnel  reflection  equation: 


R  = 


n,  +n^ 


(1 


Where  ns  is  the  index  of  refraction  for  the  ambient  medium  (air,  nA=  1)  and  ns  in 
the  index  of  the  solid  (semiconductor).  Taking  GaAs  (the  most  used  semiconductor  of 
these  studies  thus  far)  as  an  example,  then  ns  «  4.025  and  the  reflection  will  be  36% ! 
For  InP,  the  other  semiconductor  use  to  date,  the  reflective  loses  are  somewhat  less  at 
31%.  Finally,  for  InSb  or  In  As  ,  which  appear  to  be  an  even  better  E-M  sources  based  on 
their  mobility,  reflection  lose  could  be  as  high  as  41%.  It  is  obvious  even  from  these 
simple  estimations  that  dividing  the  laser  source  between  many  targets  will  significantly 
affect  the  semiconductor  excitation  level  and  hence  the  resulting  E-M  field  strength 
which  is  the  fundamental  gauge  for  most  of  the  evaluations  thus  far.'  Therefore,  even  the 
basic  development  efforts  underway  now  would  profit  by  increasing  the  light  coupled  into 
the  semiconductors  and  improving  the  precision  of  the  research. 

DISCUSSION  OF  RESEARCH  PROBLEM 
Anti-reflection  coatings  can  range  from  a  simple,  single  layer  having  virtually 
zero  reflection  at  a  particular  wavelength  to  a  variable  index  or  multi  layer  system  having 
minimal  reflectance  over  a  range  of  several  octaves.  The  particular  choice  depends  on  a 
number  of  intrinsic  parameters  of  the  physical  system,  in  particular,  as  already  mentioned, 
the  indices  of  refraction  for  both  the  coating  and  the  substrate  (assuming  an  air  or  vacuum 
ambient).  These  parameters  are  not  as  simple  as  often  assumes  and  are,  in  fact,  complex 
numbers  the  real  and  imaginary  components  of  which  are  wavelength  dependent.  Thus 
the  true  index  would  be  given  by: 


T)  =  n  -  ik  2) 

where  n  is  the  (traditional)  real  index  of  refraction  and  k  is  the  extinction 
coefficient.  While  k  can  usually  be  made  quite  small  for  most  dielectric,  it  is  in  general 
finite  and  of  the  same  order  as  n  in  semiconductors.  It  can,  therefore,  not  be  ignored,  n 
and  k  are  related  to  the  particular  solid  through  the  high  frequency  dielectric  constant 
which  itself  is  a  function  of  the  light  wavelength. 

-  k^  =  e  (A. )  (3 

The  optical  constants,  n  and  k,  are  both  strongly  varying  functions  of  the  light 
frequency,  v  =  1/X,,  particularly  at  wavelengths  near  the  absorption  edge  of  the 


The  numbers  calculated  are  based  on  published  information  for  the  indicated  semiconductors  [7,8].  and 
assume  a  visible  green  wave  length  of  526.5nm  for  the  YLF  laser  source  after  frequency  doubling.  Some  of 
the  variations  suggested  for  improving  the  E-M  field  strength  will  require  AR  coatings  optimized  for  %  « 
1053nm  and  other  wavelengths  near  lOOOnm. 
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semiconductor.  Other  effects  such  as  impurity  levels  crystallographic  imperfections  and 
surface  preparation  (damage)  play  significant  roles  in  the  effective  optical  constants  of  a 
particular  specimen.  Finally,  the  polarization  of  the  light  itself  is  an  important  factor  in 
determining  R.  Generally  the  LIPPES  experiments  maintain  ‘p’  polarization  to  minimize 
the  reflection  at  the  beam  spliters,  dielectric  beam  steering  mirrors  and  semiconductor 
surface.  For  reasons  related  to  the  E-M  radiation  pattern  in  the  LIPPES  experiments, 
normal  incidence  is  preferred.  In  that  case  the  actual  reflectivity  equation  becomes: 


i^s  +ks^ 


(4 


Since  ns  and  ks  can  be  analytically  derived  from  the  parameters  measured  by  research 
ellipsometers,  those  values  will  be  the  starting  point  for  the  design  of  the  AR  coatings. 

Once  the  optical  constants  of  the  particular  semiconductor  wafer  surface  are 
known  the  task  of  determining  the  AR  coating  depends  primarily  on  the  wavelength  range 
for  which  reflection  must  be  a  minimized.  In  the  case  of  the  very  narrow  bands 
represented  by  laser  sources,  single  AR  layers  of  single  index  can  usually  be  defined 
which  will  reduce  total  reflection  to  a  few  percent.  As  this  will  be  a  significant 
improvement  for  the  LIPPES  measurements,  was  is  the  direction  chosen  for  this  program. 
Assuming  then  a  single  layer,  it  is  readily  shown  that  the  condition  on  the  index  of  the 
coating.  Tic,  for  the  minimum  reflection  is: 


nc  =  (ha  (5 

provided  that  the  coating  has  an  optical  thickness  one  quarter  of  the  light  source  wave 
length,  i.e.: 


ncd  =  Ay4  (6 

where  d  is  the  actual  thickness. 

(At  normal  incidence  the  correction  for  absorption  coefficient  can,  and  will,  be  neglected. 
The  challenge  then,  for  GaAs,  (with  Hs  =  4  at  1056nm)  was  to  specify  and  deposit  a 
dielectric  would  have  an  index  in  the  variable  about  2.00  ±0.10,  that  will  withstand  the 
humidity  and  common  solvent  fumes  of  the  laboratory  and  which  can  be  formed  into 
sputtering  targets  from  available  high  purity  chemicals.^  For  example,  using  GaAs  with 
Hs  «  4  at  527nm,  the  simple  calculation  of  From  various  published  sources  some  AR 
candidates  can  then  be  defined.  For  example,  Si3N4  has  n  =  1.97  to  2.02  when  deposited 
by  sputtering  and  readily  available  in  very  high  purity.  There  is  now  tabulated 


^It  should  be  noted  that  the  added  flexibility  provided  by  double  layer  AR  coating  can  almost  certainly 
provide  R  =  0  for  single  wavelength  sources  at  normal  incidence  [8].  Since  double  layering  allows  both 
thickness  and  index  of  the  AR  films  to  be  varied,  it  is  possible  to  choose  for  the  interface  AR  layer  a  surface 
passivating  compound.  Surface  passivation  will  allow  the  dc.  bias  field  to  be  increased  as  necessary  to 
insure  velocity  of  saturation  of  the  accelerated  charges.  So,  this  research  also  provides  an  excellent 
foundation  for  future  efforts  to  optimize  the  LIPPES  technique  for  airborne  and  other  applications. 
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information  on  a  variety  of  dielectric  compounds  for  which  the  refractive  index  has  been 
measured  at  least  over  some  portion  of  the  visible  and  near  infrared  spectrum  [9].  It  is 
also  possible  to  ‘tailor’  a  coating  for  a  specific  index  values.  Ternary  mixtures  such  as 
silicon  oxynitride,  SiOxNy,  with  1.46  (Si02)  <  nc  ^  2.10  (Si3N4)  and  aluminum 
oxynitride,  AlOxNy  with  1.76  (AI2O3)  <  nc  ^  2.2(A1N)  would  provide  a  composition 
tunable  index  of  high  durability  (hardness  and  moisture  resistance). 


EXPERIMENTAL  APPROACH 

AR  Coating  Preparation: 

All  films  were  prepared  by  reactive  RE  magnetron  sputtering  in  a  SS  chamber  pre  cryo- 
pumped  (15  °K,  closed  cycle  He)  to  P  <  1x10'^  Torr.  After  the  specimens  were 
introduced  into  the  vacuum  chamber  and  prior  to  introduction  of  the  sputtering  gases,  the 
system  was  baked  to  T  >  100°C  to  reduce  the  residual  oxygen  partial  pressure. 
Approximately  12hrs  was  required  for  the  bake  out  cycle.  Zero  grade  argon  and  nitrogen 
were  introduced  through  PE)  gas  flow  controller  and  associated  valves.  Total  pressure  for 
the  film  production  was  5mT  for  the  combined  Ar+N2  gases  (typically  4mT  Ar  and  ImT 
N2  during  the  sputtering  cycle).  The  target  for  all  the  mns  of  this  study  were  polished,  N- 
type  semiconductor  grade,  silicon  wafers  doped  to  IxlO’^  with  phosphorous.  The 
specimens  where  nominally  cleaned  by  rinsing  in  acetone  and/or  methanol  and  blown  dry 
with  N2  before  loading  into  the  UHV  chamber.  The  target  to  specimen  distance  was 
18cm.  All  run  were  done  with  total  RF  power  at  200  Watts. 

In  order  to  develop  a  baseline  for  sputtered  Si3N4  thichness,  calibration  run  were  done 
with  silicon  wafers  (rather  than  GaAs)  as  the  substrate  specimens.  Undoped  silicon 
wafers  were  also  used  as  the  sputtering  target  and  there  was  some  danger  in  sputtering 
through  those  damaging  the  sputtering  gun.  Therefore,  the  depth  of  the  sputtering  craater 
was  closely  monitored  during  the  calibration  runs  to  track  target  oblation.  Typically  three 
runs  were  done  before  the  silicon  target  was  changed. 


FIGURE  2: 
Comparison  of 
reflectivity  of 
GaAs  measured 
for  these  studies 
with  reported 
data  from  the 
literature*. 
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The  data  in  figure  2  shows  the  effect  of  the  N2  partial  pressure  on  the  SixNy 
deposition  rate.  As  predicted  there  is  an  inverse  relationship  of  deposited  rate  with 
increasing  partial  pressure  of  N2  (in  Ar). 

Based  on  the  information  developed  above  the  following  films  were  prepared  to 
specifications  provided  to  us  by  the  Air  Force  scientists  at  Hanscom  AFB; 


Specimen 

number 

Design 
thickness  (A) 

Sputtering  time 
(min) 

Measured 
thickness  (A) 

Index  of  refraction, 
Ha 

5239 

40.0 

1670* 

* 

5240 

938 

20.0. 

945 

1.973 

5241a 

656 

15.5 

693.3  ±  2.6 

1.9471  .001 

5241b 

649 

16.0 

627.8  ±  3.4 

1.9661  .001 

5242 

976 

22.5 

969.4  ±  3.3 

1.9561  .002 

*NOTE:  Ellipsometry  on  specimen  5239  gave  undetermined  results  thus  there  is  no  measurement  for  n. 
The  thickness  value  is  from  Dektak  measurements.  From  this  thickness  an  approximate  sputtering  time  of 
20  min.  was  calculated  for  the  next  run. 


Reflectivity  Measurements: 

Evaluation  of  the  Si3N4  films  as  AR  coatings  was  done  performed  by  measuring 
the  reflectivity  directly  as  a  function  of  wavelength  using  a  LEXEL,  model  480  Avante, 
Ti-  Sapphire  laser  excited  by  a  5  Watt,  CW  SPECTRA  PHYSICS,  Millennia,  diode 
pumped,  YAG  laser  (frequency  doubled  to  532nm).  The  LEXEL  was  fitted  with 
dielectric  cavity  mirrors  optimized  for  a  wavelength  range  of  700  <X,<  850  nm  and  was 
wavelength  tunable  via  a  rotatable  bi-refringent  crystal.  A  dielectric  mirror  reflected  5% 
of  the  extant  light  into  a  NEW  FOCUS,  Inc.  Model  7711  Fuzeau  wavelength  meter  to 
monitor  the  X  parameter.  Approximately  8%  of  the  remaining  power  was  reflected,  with 
a  microscope  slide,  to  the  detector  element  of  a  OPHIR  Laser  Power  Meter  used  to 
monitor  the  incident  power.^  The  remaining  light  passed  through  the  slide  and,  after 
reflecting  form  the  specimen  under  test,  reflected  to  a  MOLECTRON  EPMIOOO 
energy/power  meter  fitted  with  a  J3  detector.  A  program  was  developed  using  HP-VEE 
to  acquire  data  from  the  individual  measuring  devices. 

The  procedure  for  obtaining  the  data  was  to  adjust  the  bi-refringent  crystal  for  a 
specific  wavelength  and  then  run  the  program  which  took  ten  readings  at  each  X  setting. 
The  data  was  stored  in  tabular  form  and  later  averaged  to  provide  incident  power, 
reflected  power  and  reflectivity  (Preflected/Pincident)  as  a  function  of  X  for  the  plots  below. 

Before  each  specimen  was  measured,  a  first  surface  mirror  was  placed  at  the 
specimen  position,  the  two  power  meters  were  ‘zeroed’  with  the  laser  blocked  and  the 
room  lights  off,  and  then  the  laser  was  turned  on  and  the  ratio  of  incident  to  reflected 
power  was  measured.  This  number  was  then  use  as  the  ‘100%  reflectivity’  value  for 
adjusting  the  subsequent  measurements  with  the  specimen  in  place. 


^  The  microscope  slide  had  essentially  ‘flat’  transmission  over  the  X  range  used  here  and  so  no  correction 
was  made  for  its  presence. 


7-7 


RESULTS 

Before  the  specimens  produces  in  the  facilities  at  Brown  University  were  evaluate 
the  reflectivity  setup,  described  above,  was  used  to  measure  a  specimen  of  un-coated 
GaAs  of  the  type  that  was  intended  for  AR  coating.  This  was  done  after  successively 
cleaning  in  e-grade  acetone,  e-grade  methanol  and  etching  in  HF;NH30H,  rinsing  in  DI- 
water  and  finally  blow  drying  in  dried  zero  grade  nitrogen.  The  specimen  was  measured 
as  described  above  and  then  the  plot  of  reflectivity  versus  wavelength  was  generated  as 


FIGURE  3: 
Comparison  of 
reflectivity  of 
GaAs  measured 
for  these  studies 
with  reported 
data  from  the 
literature*. 


shown  in  figure  3. 
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Added  to  that  plot  is  the  data  of  Aspenes  and  Studa*  which  was  made  on  vacuum  cleaved 
GaAs  at  IxlO'^T  and  is  the  most  accurate  available  for  GaAs.  As  can  be  seen  in  the 
figure,  the  data  for  this  study  tracks  well  the  published  data  except  for  small  variations  in 
the  region  around  750nm.  Based  on  the  good  agreement  here  the  system  was  applied  to 
the  AR  coated  specimen  in  the  same  fashion. 

The  plot  of  figure  4  shows  the  reflectivity  of  the  167nm  coated  specimen(#5239).  While 
the  167nm  AR  coating  was  prepared  primarily  as  thick  layer  on  the  first  GaAs  specimen 
for  reference  purposes.  When  it  was  measured  for  reflectivity,  it  was  found  to  approach  a 
minimum  at  in  the  vicinity  of  760nm.  This  is  in  agreement  with  the  predicted  minimum 
X  for  a  film  of  167nm  thickness  which  should  occur  at  an  odd  multiple  of  1/4 A,c,  i.e.  the 
wavelength  corrected  for  the  index  of  the  AR  coating.  (In  this  case  N=17  would  imply  a 
minimum  for  161nm  thickness).  As  seen  in  the  figure  the  reflectivity  not  only  is 
approaching  a  minimum  but  also  has  an  extremely  low  absolute  reflectivity  on  the  order 
of  0.5% !  This  means  that  thicker  layer  are  acceptable  and  might  even  have  some 
advantages  over  the  first  quarter  X  thickness  due  to  better  adhesion.  There  are  two 
separate  series  of  data  plotted  in  figure  4. 
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FIGURE  4: 
Wavelength 
dependence  of 
Si3N4  nims 
reactively 
sputtered  on  to 
GaAs  specimen 
#5239. 


Series  1  was  done  with  the  small  amount  of  532nm  light  that  bleeds  through  the  optical 
path  and  originates  from  the  diode/YAG  laser  used  to  excite  the  LEXEL  Ti-Sapphire 
system.  The  series  2  data  was  a  re-run  of  the  same  #5329  specimen  done  after  filters  had 
been  added  to  remove  that  532nm  component.  As  can  be  seen,  the  reflectivity  is  further 
reduced  when  the  stray  background  component  is  removed. 

Of  the  three  remaining  specimens  listed  in  the  table  above,  #5241  a  was  not 
measured  because  the  film  had  clearly  delaminated  for  the  GaAs  substrate  leaving  a 
mottled  surface  appearance. 

AR  coating  5421b  had  a  design  thickness  of  64.9  nm  chosen  to  minimize  reflection 
at  526nm  which  is  X/2  of  one  of  the  laser  lines  intended  for  use  as  in  the  LIPPES  system. 


Figure  5: 

Reflectivity  versus 
wavelength  for 
specimen  #2541b 
with  64nm  AR 
coating.  Specimen 
optimized  for 
minimum  X 
reflectivity  at  526  nm. 


The  incident  and  reflected  power  data  and  the  resultant  reflectivity  are  plotted  for 
that  specimen  in  figure  5.  A  polynomial  curve  fit  is  has  been  added  for  the  reflectivity 
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data.  Extrapolation  of  that  equation  to  shorter  wavelengths  give  a  minimum  reflectivity  at 
670nm  which  is  about  a  21%  error  but  with  an  absolute  reflectivity  on  the  order  of  5%. 

A  similar  plot  is  shown  in  figure  6  for  AR  coating  #5242  with  a  97.6  nm  design 
thickness.  Here  again  the  original  incident  and  reflected  powers  are  plotted  along  with  the 
resulting  ratio  (Pi/Pr)  to  give  the  reflectivity.  The  polynomial  curve  fit  for  that  curve  has 


Figure  6: 

Reflectivity  versus 
wavelength  for 
specimen  #5242 
with  97.6nm  AR 
coating.  Specimen 
optimized  for 
minimum  "k 
reflectivity  at  800 
nm. 


appears  to  be  approaching  a  minimum  in  the  vicinity  of  850  nm  which  implies  an  error  on 
the  order  of  6%.  Since  leveling  off  of  the  curve  is  not  observed  here  the  error  is  likely 
greater  but  the  reflectivity  at  minimum  will  clearly  be  less  than  10%  at  the  minimum.  The 
main  source  of  disagreement  between  the  predicted  minima  and  that  indicated  by  the  data 
is  probably  the  relatively  large  area  covered  by  the  reflectivity  probe  beam.  This  allows 
the  sampling  of  areas  were  there  is  gradients  in  the  thickness  and  hence  strong  variations 
in  the  reflectivity  values.  Time  constraints  did  not  permit  the  investigation  of  that 
explanation,  but  experiments  are  continuing  at  the  AF  laboratory  to  evaluate  that  effect. 
In  general  we  have  shown  that  Si3N4  reactively  sputtered  films  can  be  varied  in  thickness 
to  provide  reasonable  matches  to  specific  wavelengths.  Refinements  of  this  process 
should  provide  repeatability  to  within  a  few  percent  in  A,  with  minima  in  reflectivity  loses 
less  than  5%.  Broader  band  light  sources  will  allow  a  more  accurate  determination  of  the 
values  at  the  reflectivity  minima. 

CONCLUSION 

We  have  developed  a  procedure  for  reactively  sputtering  continuous  Si3N4  thin 
film  anti  reflection  coating  onto  GaAs  for  the  purpose  of  enhancing  the  transmission  of 
certain  discrete  laser  wavelengths.  Independent  measurements  of  absolute  reflectivity 
show  that  those  AR  coatings  reduce  the  reflective  losses  to  <  0.05%  and  that  the  process 
control  is  sufficiently  precise  to  allow  the  film  thickness  to  be  controlled  to  within 
±10nm.  This  technology  was  transferred  to  the  scientists  at  the  AFOSR  Sensors 
Technology  Division  who  were  able  to  reproduce  the  film  characteristics  and  are  adapting 
the  process  for  their  particular  specimens,  substrates  and  apparati. 
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DEVELOPMENT  OF  A  SIMULATION  MODEL 
FOR  DETERMINING  THE  PRECISION  OF  RELIABILITY 
PREDICTIONS  AND  ASSESSMENTS 


Digen  K.  Das 
Associate  Professor 

Department  of  Mechanical  Engineering  Technology 
SUNY  Institute  of  Technology 


Abstract 

Simulation  models  for  determining  the  precision  of  reliability  predictions  for  series,  parallel 
and  combination  systems  were  studied  by  applying  both  the  probability  and  possibility  theories. 
The  method  of  bounds  and  fuz2y  number  technique  were  used  to  determine  the  precision  of  the 
reliability  predications.  Available  field  data  for  capacitors  (commercial  and  military)  were  used 
for  all  computational  results. 
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DEVELOPMENT  OF  A  SIMULATION  MODEL 
FOR  DETERMINING  THE  PRECISION  OF  RELIABILITY 
PREDICTIONS  AND  ASSESSMENTS 


Digen  K.  Das 


Introduction 

It  is  well  known  that  there  exists  a  significant  difference  between  predicted  reliability  and 
actual  field  reliability  of  systems.  This  is  often  termed  as  “Reliability  Delta”  and  has  been  a  long 
standing  problem  in  reliability  engineering.  Usually  system  reliability  is  derived  from  the 
knowledge  of  component  reliability  and  the  concepts  of  statistical  life-time  distributions. 
However,  in  practice  this  knowledge  does  not  necessarily  provdde  insight  into  how  a  single 
component  will  behave  in  a  system.  As  a  consequence,  systems  all  too  often  achieve  reliabilities 
markedly  different  (usually  lower)  than  those  predicted. 

In  response  to  this  long  standing  problem,  the  Air  Force  Research  Lab  initiated  the  “New 
System  Reliability  Assessment  Methods”  program  (Ref  1,  2).  The  objective  of  this  effort  was  to 
develop  a  system  reliability  methodology  that  accounts  for  all  predominant  factors  that  affect  field 
reliability  of  systems.  The  performing  organizations  in  this  program  were  IIT  Research 
Institute/Reliability  Analysis  Center  (IITRI/RAC)  and  Performance  Technology. 

The  Summer  Faculty  Research  Program  (Ref  3)  was  designed  to  be  a  complementary 
effort  to  the  on-going  “New  System  Reliability  Assessment  Methods”  program.  The  objective  of 
the  Summer  Faculty  Research  Program  was  to  initiate  the  development  of  practical  methods  for 
determining  the  accuracy  of  reliability  predictions  and  assessments.  It  was  established  that  the 
models  for  determining  the  precision  of  reliability  should  be  based  on  both  the  probability  and 
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possibility  theories.  This  Summer  Research  Extension  Program  addresses  this  research  topic. 

The  organizational  structure  of  most  systems  can  be  described  as  a  series  system,  a  parallel 
system  or  a  combination  thereof  The  mathematical  models  for  determining  the  reliability  of  these 
systems  are  described  in  the  following  sections. 

Mathematical  Models  [Tasks  1  and  2] 

Series  System 

The  mathematical  reliability  model  for  a  simple  series  system  is  the  product  of  the 
individual  reliabilities  (Ref  4).  Figure  1  depicts  such  a  series  system  with  n  elements.  It  is 
assumed  that  each  element  (capacitor)  is  independent-that  the  success  or  failure  of  one  element 
does  not  affect  the  success  or  failure  of  any  other  element. 


Fig.  1  Logic  diagram  of  a  simple  series  system  of  n  elements. 

The  model  is  given  by  the  following  equation. 

Rt  =  Ri  X  Rj  X  R3  X  ...  X  Rj  X  ...  X  R„  (1) 

where  Rj  =  the  total  reliability 

R,  =  the  reliability  of  the  first  unit 
Rj  =  the  reliability  of  the  ith  unit 
R„  =  the  reliability  of  the  nth  (last)  unit 

This  can  be  written  in  a  more  condensed  form 


RT  =  nRi  (la) 

1 
n 

where  Yi  Rj  means  the  product  of  all  Rj  from  1  through  n. 
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Here  Rj  can  be  calculated  from  the  following  relation: 


Ri  =  e*Xi  tj  = 

where  A.;  =  constant  failure  rate 
t;  =  mission  operating  time 

Therefore, 

Rj  X  Rj  X  R3  X ...  X  Rj  X  ...  X  R„ 

=  e'^'fX  e'^'jX  e’^3  x  ...  x  e’*"}  x  ...  x  e'*'^ 

R  =e'^+*'+’'+  +•"+  +*" 

rv-T  e  1  T  2^  3  ^  ^  i  ^  ^  n) 

n 

Or  Rj  =  exp^-Yfi  j  (2) 

when  tj  =  tj  =  ...  t„ ,  then  Eq.  (2)  can  be  written  as 

n 

RT  =  exp/-t5^Ai\  (2a) 

'  i=l  ' 

Mean  Time  to  Failure  fMTTF') 

In  a  constant-failure-rate  series  system,  the  mean  time  to  failure  m  is  the  reciprocal  of  the 
system  failure  rate  X.  The  reliability  of  a  component  is  its  probability  of  survival.  If  a  large 
number  of  components  are  put  on  test,  then  the  reliability  at  any  time  t  is  equal  to  the  ratio  of  the 
number  of  units  still  operating  at  that  time  (surviving).  Ng,  divided  by  the  initial  total  number  Nj, 
as  given  by  Eq.  3. 


R(t)  =Ns 
Nt 


(3) 
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Since  the  number  of  units  still  operating  Nj  is  equal  to  the  total  number  minus  the  number 
which  have  failed  Np,  Eq.  (3)  may  be  rewritten  as 

R(t)  =  Ns  =  Nt^_=  1  -  Np  (4) 

Np  Np  Np 

The  total  number  of  units  Np  is  constant  while  the  number  which  have  failed  increases  with  time. 
Hence,  we  can  write: 

dR  =  da-Np^p}  =  -l  ^  (5) 

dt  dt  Np  dt 

By  rearranging  the  terms  in  Eq.  (5)  we  obtain 

1  ^  =  -  dR  (6) 

Np  dt  dt 

In  Eq.  (6),  ‘‘^F/dt  is  the  frequency  at  which  failures  occur  when  the  total  number  of  units 

Np  remains  constant  (no  replacement  of  failed  units.)  When  plotted  on  a  graph  as  a  function  of 

time  t,  we  obtain  the  time  distribution  of  failures.  If  we  divide  ‘‘^F/dt  by  the  initial  total  number  of 

units  Np,  we  obtain  the  distribution  of  failures,  or  failure  frequency  curve,  per  component.  Such  a 

unit  failure  distribution  curve  is  called  a  failure  density  function,  of  simply  f(t).  Substituting  this 

term  into  Eq.  (6),  we  obtain 

f(t)  =  -dR  (7) 

dt 

Equation  (7)  applies  to  all  possible  failure  density  functions  and  not  just  to  the  case  of  a  constant 
failure  rate. 

Now,  the  mean  time  to  failure  m,  like  any  mean  value,  is  the  first  moment  about  the  origin 
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of  the  parameter  being  considered.  In  this  case  it  is  the  average  time  at  which  failure  occurs  and 
can  be  found  by  operating  all  units  to  failure,  summing  the  times  to  failure,  and  dividing  by  the 
number  of  units. 

n  n 

m=  times  to  failure  =  (8) 

1  1 

ii  N-r 

where  Enr  =  Nj.  Equation  (8)  defines  the  average  time  to  failure  for  one  component.  In  the 
limit,  as  the  number  of  units  becomes  infinitely  large,  the  summation  process  becomes  an 
integration  process.  Also,  the  distribution  of  failures  per  component  Nf/Nj  becomes  defined  by 
the  failure  density  function  (d  Np/dt)  Nj,  and  m  becomes  the  integral  of  the  product  of  the  density 
function  and  time: 

oo 

m  =  J  t  f(t)  dt  (9) 

0 

Equation  (9),  like  Eq.  (7),  is  applicable  to  all  possible  density  functions.  Since  f(t)  =  -dR/dt  [from 
Eq.  (7)],  Eq.  (9)  can  be  rewritten 


Integrating  Eq.  (10)  by  parts,  we  obtain 


8-7 


We  can  show  that  the  first  term  on  the  right  side  of  Eq.  (1 1)  is  zero.  (Details  not  included) 


Therefore,  Eq.  (1 1)  reduces  to 


m  = 


f 


Rdi 


(12) 


This  means  that,  in  the  general  case,  the  mean  time  to  failure  can  be  obtained  by  integrating  the 
reliability  function  over  the  time  interval  0  to  <».  Hence,  for  the  specific  case  of  a  constant  failure- 
rate  component,  we  can  write: 


R  =i  ejq} 


exp  I -A-  I  dt]  =  exp  (-A;/) 


which,  of  course,  we  already  know.  From  Eq.  (12), 


m  = 


Rdt  =  I  e  *‘*‘dt  =  j 

Jo  Jo  ■ 

-i[e-“  -eO]  =  -1(0  -  1)  =  1 


1- 


1- 


(13) 


In  a  series  system,  X  is  merely  the  sum  of  the  failure  rates  of  the  individual  components,  so 


that 


Parallel  Systems 

Two-Unit  Parallel  System  -  One  Required 


(13a) 


Fig.  2  Two-Unit  Parallel  System 

In  a  simple  parallel  system  consisting  of  two  units  in  parallel  with  both  operating  but  only 
one  required,  both  must  fail  for  the  system  to  fail.  The  probability  of  the  first  unit  failing  is  Q, 
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In  the  case  where  A,  =  Xj ,  Eq.  (19)  reduces  to 


ni  =  i  +  i-  i=  3 

A  A  2A  2  A  (19a) 


Fig.  3  Three-Unit  Parallel  System 

When  there  are  three  units  in  parallel  and  only  one  is  required  as  in  Fig.  3,  all  three  must 
fail  for  the  system  to  fail,  and  the  probability  of  system  failure  is 

-or£«iVe3 .  (i  -  /"') .  (i  -  X  (i  -  /"*) 
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This  expands  to 


-Pi 
1  -  e  ‘ 


-F  -F 
-  e  ^  -  e  ^  + 


/  -Fj  -F  ' 
+  ( e  ^  X  e  ^ 


The  reliability  is,  of  course,  one  minus  the  failure  probability  and  can  be  written 


-F.  -F,  -F,  -(F, +F,)  -r^F.+F,)  “ 

‘  +  e  ^  +  e  ^  -  e  ‘  ^  c 

^-<F2  +  ) 


This,  in  turn,  can  be  expressed  as 


Rrp  =  R-^  +  ^2  ^3  “  X  ^2^  “  9^ 

-  (R^  X  Rg)  + 


In  terms  of  failure  probabilities, 

Rt=1-(Q,xQ2  xQj)  (22) 

Equations  (20),  (21),  and  (22)  for  three  units  in  parallel  with  only  one  required  reduce  to 
Eqs.  (20a),  (21a)  and  (22a),  when  the  three  units  are  alike. 


RT  =  3e-''-3e^+  e'^*^ 
Rt  =  3R-3R^+  R^ 
Rt  =  1  -  Q' 


(20a) 

(21a) 

(22a) 


For  a  three-unit  parallel  system  where  only  one  is  required  to  operate,  the  MTTF  is 
computed  as  follows.  In  the  general  case  when  the  failure  rates  are  different,  Eq.  (20)  is  used  in 
the  integration  with  At  used  rather  than  F. 

1 - — — 

A,  +  A2  Aj  +  A3 


A2  +  A3  A^  +  A2  +  A3 

When  the  failure  rates  are  the  same,  Eq.  (20a)  is  used. 
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(23a) 


m  =  3-3+  i=  18  -  9+  2=  11 
1  21  31  61  61  61  61 


Multiple-Unit  Parallel  System  -  One  Required 


Fig.  4  Multiple-Unit  Parallel  System 


In  the  large  majority  of  cases,  when  there  are  more  than  three  units  in  parallel,  all  units  are 
the  same.  Consequently  the  following  derivations  consider  only  cases  of  identical  units. 

The  equations  for  the  reliability  and  mean  time  to  failure  for  additional  redundancy  when 
only  one  operable  unit  is  required  can  be  found  by  expanding  the  derivations  for  Eqs.  (20) 
through  (23a)  to  cover  the  additional  units.  In  the  equations,  n  is  the  total  number  of  units  and 
n-1  fmlures  are  permitted.  The  three  equivalent  reliability  expressions  are 

Rf  =  ne'*"  -  n(n-l)  e'^  +  nfn-l)  (n-2)  e’^*" 

2  2x3 

-  n(n-n  (n-2Vn-3)  e""^  +  ...  (±)  e^^ 

2x3x4  (24) 

Rt  =  nR  -  nrn-n  R^  +  n^n-U  (n-2)  R^ 

2  3! 

-  n(n-\)  (n-2¥n-3^  K*  +  ...  (±)  R" 

4!  (25) 
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Rt=i-Q" 


(26) 


The  mean  time  of  failure,  derived  from  the  complete  expansion  of  Eq.  (24)  after  integrating, 
combining  terms,  and  simplifying  is  found  to  be 
m  =  i  +  l+  i  +  i  +...+  1 

X  2X  3X  4X  nA  (27) 

It  can  be  seen  from  the  preceding  equations  [and  from  Eq.  (26)  in  particular]  that  as  more 
units  are  added  in  parallel,  the  total  reliability  rapidly  approaches  unity,  differing  only  by  Q". 
However,  as  shown  by  Eq.  (27),  m  does  not  increase  correspondingly  and  each  additional  parallel 
redundant  unit  contributes  less  to  the  system  MTTF  than  did  its  predecessor. 

Combination  Systems 

The  organization  structure  of  most  systems  can  be  described  as  a  combination  of  series 
and  parallel  systems.  The  logic  diagram  of  such  a  system  is  shown  in  Fig.  5. 


Fig.  5  Logic  Diagram  of  a  Combination  System 
The  precision  of  reliability  prediction  for  these  types  of  combination  system  can  be 
determined  by  the  application  of  Limiting  Value  technique  (Method  of  Bounds).  The  model 
determines  the  limiting  values  (upper  and  lower  bounds)  of  reliability  of  the  system  and  obtains  a 
single  system  reliability  prediction  value,  utilizing  the  upper  and  lower  bounds.  The  model  is 
described  below: 
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Calculation  of  the  Upper  Bound 


The  calculation  of  the  upper  bound  considers  only  those  elements  failures  which 
individually  would  cause  mission  failure.  Hence,  only  those  blocks  in  the  logic  diagram  which  are 
in  series  are  considered.  Generally,  this  is  sufficient  to  provide  a  satisfactory  estimate.  However, 
if  the  reliabilities  associated  with  individual  blocks  are  not  very  high,  it  may  be  necessary  to 
consider  system  failures  resulting  fi-om  multiple  component  failures,  i.e.  to  consider  the  parallel 
blocks.  When  only  the  series  elements  are  considered,  the  system  reliability  upper  bound  is 
the  product  of  the  reliabilities  of  elements  A  and  B. 

Rupper  “  ^  ^ 

Assuming  an  exponential  model  (constant-failure-rate  system),  the  reliabilities  of  the  components 
can  be  expressed  as: 

where  and  Fg  are  the  mission  failure  rates  of  elements  A  and  B  respectively.  Therefore, 

Rapper  =  e-^A  X  =  e-<^A  +  ^ A) 

In  general,  the  upper  bound  can  be  calculated  using  the  following  equation 


where  is  the  mission  failure  rate  of  series  element  i,  and  m  is  the  number  of  series  elements. 

It  is  seldom  necessary  to  consider  parallel  elements.  However,  when  the  reliabilities  of 
some  elements  are  relatively  low  (e.g.,  less  than  0.99),  and  these  are  functionally  in  parallel,  the 
system  reliability  upper  bound,  if  only  series  elements  are  considered,  may  be  overly  optimistic.  It 
can  be  lowered  by  considering  multiple  failures.  In  Figure  5,  failure  of  any  of  the  following  pairs 
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of  elements  results  in  mission  failure:  C  and  E,  C  and  F,  D  and  E,  D  and  F,  or  G  and  H.  If  these 
components  are  not  highly  reliable,  there  may  be  more  than  a  negligible  chance  of  both  elements 
in  a  pair  failing.  In  that  case,  two  elements  at  a  time  should  be  considered. 

When  system  failure  results  from  failure  of  two  parallel  elements,  all  series  elements  must 
be  good  (otherwise  the  system  would  have  failed  due  to  the  failure  of  a  series  element).  The 
series  elements  must  therefore  be  included  in  these  calculations.  Hence,  system  failure  probability 
resulting  from  failure,  for  example,  of  elements  C  and  E  in  the  preceding  logic  diagram  is  defined 
as  X  Rg  X  Qc  X  Qg,  where  Q  =  1  -  R  Similar  expressions  apply  to  the  other  pairs  of  parallel 
elements  enumerated  above. 

®-A  ^  ^  ^  Qc  ^  Qf  ^  ^  ^  Qd  ^  Qe 

R-a  ^  ^  Qd  ^  Of  ®-a  ^  *  Qo  ^  Qh 

The  probability  of  system  failure  resulting  from  the  failure  of  two  non-series  elements  is 
found  by  adding  these  terms. 

P  =  (Ra  X  Rb  X  Qc  X  Qe)  +  (Ra  X  Rb  X  Qc  X  Qf)  +  ... 

+  (RaXRbxQgxQh) 

This  can  be  simplified  to 

P  =  Ra  X  Rb  X  [(Qc  X  Qe)  +  (Qc  x  Qf)  +  +  (Qo  x  Qh)] 

Since  R^  x  Rb  =  e'^'i ,  from  Eq.  (28),  we  can  write  this  as 


where  m  is  the  number  of  series  elements,  Qj.  and  Q,..  are  the  failure  probabilities  of  pairs  of 
parallel  elements  which  together  cause  system  failure,  and  x  is  the  number  of  such  pairs.  This 
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value  is  subtracted  from  Eq.  (28)  to  obtain  the  system  reliability  upper  bound  when  two  failures 
are  considered: 


Calculation  of  the  Lower  Bound 

The  lower  bound  is  found  by  adding  probabilities  of  success  cases.  In  redundant  systems, 
there  are  usually  many  cases  where  one  or  more  element  failures  can  occur  without  causing 
system  failure.  Therefore,  it  is  always  necessary  when  calculating  the  lower  reliability  bound  of  a 
redundant  system  to  consider  all  cases  with  one  failure,  and  frequently  cases  with  two  or  even 
three  failures. 

The  first  calculation  considers  the  case  of  no  failures,  that  is,  where  all  elements  are  good. 
This  is  simply  the  product  of  the  reliabilities  of  all  elements, 

n 

^^tower  ~  n  ^ 

J=1 

where  is  the  reliability  of  any  element,  series  or  parallel,  and  n  is  the  total  number  of  elements. 
In  the  previous  logic  diagram  of  Fig.  5,  the  first  calculation  of  the  lower  bound  is  equal  to  x 

X  Rc  X  ...  X  Rh- 

=  X  e\x  e-'^c  X  ...  X  e’^H  =  exp 
The  general  case  for  zero  failures  is  expressed  as  follows: 


A  success  is  also  achieved  if  one  non-series  element  fails  and  all  other  elements  are  good. 
Thus,  a  failure  of  any  one  element,  C,  D,  E,  F,  G,  or  H  is  still  a  system  success.  These  cases  are 
listed  here  for  clarity,  with  R  denoting  success  and  Q  denoting  failure. 
(RaxRbxQcxRoxRexRfxRoxRh) 

+  . 


+  (Ra  xRbxRcxRdX^xRjjxRoX  Qh) 


These  expressions  can  be  simplified  by  multiplying  by  appropriate  terms  equal  to  unity  and  then 
simplifying,  as  follows:  xRf  xRg  xR;^) 


BpxR„xR„j 


'  •«-  i'. 


I  ^  ^  ^  ~  X  Rfi  X  Rp.  X  Rq  X  Ri 


X  Rq  X  Rg  X  Rp 


^h\ 

xR,x«„x-j 


f  ‘■ipoP  ■  ^ yr* '  ’  ’  *• 

X  Rfi  X  Rq  X  Rq  X  /?£.  X  Rp  X  Rq  X  Rfw  X 


A'! ■ 


^  (  X  R^j  X  Rjg  X  Rp.  X  Rq  X  Rp  X 


^  ^  ^  Rp  ^  Rq  X  Rff  X  ^ 

igL  ■  ‘  ^H/ 

X  Rq  X  Rjd  X  Rg  X  Rp  X  Rq  X  Rg) 


X  I  - + - +  .  .  .  +  - 
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The  first  term  is  simply  the  probability  of  all  elements  being  good,  as  expressed  in  Eq.  (31).  The 
second  term  is  the  sum  of  the  ratios  of  failure  probability  to  success  probability  of  all  non-series 
elements.  The  product  of  these  two  terms  can  be  written  as 


=  exp 


-iMWi 


where  (5kis  the  probability  of  failure  of  a  non-series  element,  is  the  reliability  of  the  non-series 
element,  and  p  is  the  number  of  non-series  elements  of  the  system.  The  equation  for  cases  of  two 
non-series  element  failures  which  together  do  not  cause  system  failure  is  similarly  derived; 


where  k  and  i  are  pairs  of  non-series  elements  which,  failing  together,  do  not  cause  system  failure 
and  p’  is  the  number  of  such  pairs.  Equations  (3 1),  (32),  and  (33)  are  then  added  to  give  the 
lower-bound  probability  of  success  considering  no  element  failures,  one  element  failure,  and  two 


element  failures. 
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Final  Combined  Prediction 


The  last  step  is  the  combining  of  the  two  bounds  to  obtain  a  single  system  reliability 
prediction  value.  The  easiest  method  would  be  to  take  the  simple  arithmetic  average  of  the  two 
bounds.  However,  experience  has  shown  that  this  results  in  an  overly  pessimistic  value.  It  has 
been  found  empirically  that  the  true  value  of  the  system  failure  probability  can  be  more  closely 
approximated  by  taking  the  square  root  of  the  product  of  the  unreliabilities  associated  with  the 
upper  and  lower  reliability  bounds.  This  is  then  subtracted  from  unity  to  obtain  the  single 
prediction  value. 


upper' 


R 


lower  ^ 


(35) 


It  is  important  that  the  calculations  of  the  two  bounds  be  stopped  at  the  same  point.  That  is,  if 
only  cases  of  one  failure  are  considered  for  the  upper  bound,  then  cases  of  only  zero  and  one 
failure  should  be  considered  for  the  lower  bound;  if  two  failures  are  considered  for  the  upper 
bound,  then  consider  cases  of  two  failures  for  the  lower  bound,  etc.  Inaccuracies  will  result  in  Eq. 
(35),  if  the  calculations  are  not  carried  to  the  same  point. 

Results  and  Discussions 

The  computational  data  used  in  this  investigation  for  the  models  described  in  the  previous 
section  were  made  available  by  the  Reliability  Analysis  Center  (RAC),  Rome,  N.Y.  The  data  used 
are  presented  in  the  appendix.  Two  types  of  capacitors  were  used  and  they  are  listed  below. 
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Table  1 


Capacitor  Component  Reliability  Results 

Part 

Number 

Part  Description 

Quality  Level 

Application 

Environment 

Data  Source 

Reliability 

1 

Fixed,  electrolytic, 
tantalum 

Commercial 

GB 

13567-021 

0.9947 

2 

Fbced,  electrolytic, 
tantalum 

Military 

GF 

14851-000 

0.9938 

It  may  be  noted  that  because  of  the  limited  nature  of  the  available  field  data,  the 


component  reliability  results  shown  in  the  table  should  be  treated  with  caution.  A  degree  of 

uncertainty  exists  in  the  data  used  for  the  computation.  The  reliability  results  for  various  forms  of 

series  and  parallel  systems  are  shown  in  Table  2.  The  results  indicate  that,  as  expected,  the 

reliability  of  the  parallel  system  rapidly  approaches  unity,  when  more  components  are  added. 

The  reliability  and  bounds  for  the  combination  (series  and  parallel)  system  are  presented  in 

Table  3.  It  is  also  pictorially  shown  in  Fig.  6. 

1 


^Hjppcr 

l^yslem  ' 
F.lowcr  - 


Cd, 

15 

.2 

"5 

cti. 


Upper  bound 
Single  prediction 
Lower  bound 


0  I— - - - 

Fig.  6  Upper  and  lower  bounds  for  the  combination  system 
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Here  the  equations  of  the  mathematical  models  are  precise;  however,  the  type  of  results 
shown  is  always  questionable  because  of  the  inherent  uncertainty  of  the  data  used.  This  well 
known  problem  in  reliability  engineering  was  addressed  by  Zadeh  (Ref  5,  6),  Dempster  (Ref  8) 
and  Shafer  (Ref  9).  Their  work  in  general  is  known  as  possibility  theories.  An  attempt  has  been 
made  to  apply  this  new  theory  in  the  current  investigation.  The  details  of  this  endeavor  are 
described  in  the  following  section. 


Table  2 


Reliabilities  for  Series  and  Parallel  Systems 

Figure 

Numbers 

System 

No.  of 
Components 

Part 

Number 

Reliability 

1 

Series 

7 

1 

0.9635 

1 

Series 

7 

2 

0.9574 

2 

Parallel  -  one  required 

2 

1 

0.99997 

2 

Parallel  -  one  required 

2 

2 

0.99996 

3 

Parallel  -  one  required 

3 

1 

0.99999 

3 

Parallel  -  one  required 

3 

2 

0.99999 

4 

Parallel  -  one  required 

7 

1 

0.99999 

4 

Parallel  -  one  required 

7 

2 

0.99999 
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Table  3 


Reliabilities  and  Bounds  for  Combination  (Series  ife  Paralld)  System 

Number  of  components  =  8  (Fig.  5) 

Case  I:  All  components  of  the  system  are  good. 

Part  Number 

Upper  Bound 

Lower  Bound  Rk^er 

Reliability  R^,e„, 

1 

0.9894 

0.9584 

0.9790 

2 

0.9876 

0.9515 

0.9755 

Case  II;  One  non-series  component  fails  and  all  other  components  are  good. 

Part  Number 

l^upper 

^iower 

1 

0.9894 

0.9890 

0.9892 

2 

0.9876 

0.9871 

0.9874 

The  Possibility  Theory  (Task  3) 

In  the  mathematical  models  described  in  the  previous  section  for  the  determination  of 
reliabilities  and  their  precision,  it  should  be  noted  that  although  the  equations  are  precise,  the 
statistical  data  used  in  the  computation  seldom  are.  A  consequence  of  this  is  the  inherent 
uncertainty  in  all  reliability  results.  The  uncertainty  is  mainly  due  to  the  fact  that  failures  being 
relatively  rare  events  (typically  only  a  few  per  million  hours  of  operation),  collecting  enough  data 
on  which  to  base  a  statistical  “probability  of  failure”  is  a  costly  and  difficult  undertaking,  and  the 
relevance  of  the  data  to  any  particular  system,  as  well  as  its  validity,  is  often  questionable.  Further 
extrapolating  these  failure  probabilities  through  statistical  methods  to  calculate  a  system  level 
reliability  only  increases  the  uncertainty. 

This  led  to  the  development  of  the  possibility  theories,  commonly  know  as  “Fuzzy  Sets” 
theory  (FST)  and  Evidence  Theory  (ET).  The  Fuzzy  Set  Theory  was  originally  presented  by 


8-22 


Zadeh  (Ref.  5,  6)  and  provides  the  basis  of  a  possibilities  approach  to  system  reliability  evaluation 
based  on  the  premise  that  a  small  probability  does  not  always  mean  a  low  possibility  of  an  event, 
whereas  a  low  possibility  would  necessarily  imply  a  low  probability  (Ref  7).  FST  may  be  thought 
of  as  a  generalized  form  of  the  conventional  Boolean  Set  Theory.  The  difference  is  that  an  object 
can  have  a  Fuzzy  Set  Membership,  p(x),  anywhere  in  the  continuous  range  of  0  to  1,  but  for 
Boolean  Sets,  membership  is  restricted  to  values  exactly  equal  to  0  or  1. 

The  ENddence  Theory  (ET)  originated  from  the  work  of  Dempster  (Ref  8)  and  was 
developed  by  Shafer  (Ref  9).  It  is  a  new  took  for  representing  situations  in  which  various  kinds 
of  ignorance  exist  in  our  knowledge  or  information  about  a  system.  It  is  a  reasoning  approach  for 
testing  multiple  hypothesis  on  the  basis  of  evidence.  The  only  assumption  made  about  the 
evidence  is  that  the  sum  of  the  values  supporting  all  conclusions  plus  the  unknown  equals  1. 
Evidence  from  multiple  sources  is  combined  by  a  geometric  procedure  that  generates  the  mass  of 
evidence  supporting  each  possible  conclusion. 

Fuzzy  Numbers  and  the  Relevant  Arithmetic 

The  concept  of  uncertain  or  fuzzy  numbers  may  be  considered  as  an  extension  of  the 
concept  of  the  Interval  of  confidence.  This  extension  is  based  on  a  natural  and  very  simple  idea. 
Instead  of  considering  the  interval  of  confidence  at  one  unique  level,  it  is  considered  at  several 
levels  and  more  generally  at  all  levels  from  0  to  1  (Ref  10).  Here  we  consider  the  maximum  of 
presumption  to  be  at  level  1  and  the  minimum  of  presumption  to  be  at  level  0.  It  should  be  noted 
that  the  fuzzy  numbers  are  not  random  numbers  (more  commonly  known  as  Random  Variables). 
Uncertainty  and  randomness  are  two  very  different  and  important  concepts.  They  can  be  used 
together  but  should  not  be  confused. 
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Fuzzy  numbers  are  numerical  approximations  such  as  “about  5".  For  simplicity  fiizzy 
numbers  are  often  represented  with  triangular  membership  functions,  p(x),  as  shown  in  Fig.  7. 


Fig.  7 

The  width  of  the  membership  function  shows  the  range  of  possible  values.  The  number  in  Fig.  7 
can  be  interpreted  as  ranging  from  3  to  7  with  the  values  near  5  being  most  similar  to  5  or  better 
satisfying  the  membership  property:  “close  to  5",  than  those  farther  away  (Ref  11,  12).  At  this 
stage,  probably  it  will  be  desirable  to  relate  the  concept  of  the  Interval  of  Confidence  to  the  Level 
of  Presumption  with  a  simple  example. 

Suppose  a  certain  job  is  to  be  completed  between  two  dates,  say  June  10  and  June  30. 
This  is  an  interval  of  confidence.  On  the  other  hand,  let  us  assume  that  this  same  job  is  to  be 
completed  on  June  20,  a  possible  date.  The  interval  of  confidence  in  the  first  case  is  [June  10, 
June  30]  while  in  the  second  case  it  is  [June  20,  June  20].  If  we  wish,  we  may  assign  two  levels 
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of  confidence  to  these  two  situations,  0  for  [June  10,  June  30]  and  1  for  [June  20,  June  20]. 
These  two  levels  of  confidence  are  in  fact  Levels  of  Presumption,  and  we  can  represent  them  by 
[0,1].  Of  course,  there  is  no  reason  why  we  should  limit  ourselves  to  only  the  two  values  0  and  1. 
We  could  have  selected  a  much  larger  set  of  values  such  as: 

Va„  OjG  [0,1]: 

(a.  <  Oj)  -  ([»,”.  82“®)=  [a,“  a,'""]). 

This  means  that  if  a  increases,  the  interval  of  confidence  never  increases.  Fig.  8  illustrates 
graphically  the  mathematical  relation.  It  may  be  noted  that  the  different  values  of  a„  a2...etc.  are 
commonly  called  a  -  cuts  (Ref  10). 


Fig.  8  Definition  of  fiizzy  numbers 
The  Basic  Operations  on  Fuzzy  Numbers  [Ref  1 1,  12] 

Att  +  Ba  =  [a“.  a®]  +  [b“.  b®] 

=  (af +  bf.a?  +  b?l  =  [c?.c?] 

A„-B„  =  [af.a?]-[bf.b?l  (2) 

=  laf -b?.a?-bf]  =  (c?.c?l 
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(3a) 


For 


A„xB„=(af.a?]x(bf.b?l 

=  (min(af  bf ,  af  b® .  a®  b“ ,  a®  bf ). 

max(a®  b“ .  a®  b® .  a®  b“ ,  a®  b® )] 
=  (cf.c?] 

A„/B„={af.a?l/(bf.b?] 

=  (inin(af/bf .  af/bf .  aj/bf.  a?/bf ). 

max(af/bf .  af/b?.  a?/bf .  af/b?)l 
=  [cf.c?l 

1/B„  =  l/lbf.  b?] 

=  (min(l/b®.  l/b“).  max(l/b® .  1/b®)] 

=  (cf.c?l 


af .  a?,  b 


a 
I  * 


Aa 

Aa 


bJ.k^O 


±k  =  [af.a?]  +  k 

=  (af  ±k,a?±k]  =  [cf.c?) 

xk=(a®,a®lxk 

=  [inin(ka®,  ka®),  max(ka®.  ka®)] 

=  (cf.c?] 

we  get  the  following  simplifications; 


(5a) 

(6) 

(7) 


A„xB„  =  lafbr.a?b?] 
A„/B„=la?/b?.a?/bf] 
1/B„  =  [1/bf .  1/bf  ] 


(3b) 

(4b) 

(5b) 


Aa  X  k  =  [ka® ,  ka®  ]  (7b) 


The  operations  in  (4)  and  (5)  are  undefined  if  the  interval  B„  contains  0.  In  the  limit,  as 
bf  or  bf  goes  to  0  and  the  resulting  interval  becomes  infinite. 

Analysis  of  Series  System 

The  reliability  of  a  series  system  with  multiple  components  was  discussed  in  detail  in  the 
previous  sections  (Fig.  1).  In  this  type  of  system  all  components  must  be  operational  and  for  a 
system  with  n  component,  the  reliability  is  given  by: 

=  Ri  X  X X  R„  (8) 

In  terms  of  the  failure  probabilities,  it  can  be  expressed  as 

Q^=i-[(i-Qi)(i-Q2) . (1+QJ]  (9) 
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In  this  analysis,  we  have  used  two  fixed,  electrol5^ic,  tantalum  capacitors,  identified  as  Part  1 
(Data  Source;  13567-021 -commercial)  and  Part  2  (Data  Source;  14851-000-military)  (see  Table  1). 
For  a  two-component  series  systems  equations  8  and  9  can  be  written  as: 


R^  =  R,xRj 

(8a) 

Q.,.=  1-[(1-Q,)(1-Q2)] 

(9a) 

Although  the  equations  are  precise,  the  statistical  data  used  (see  Appendix)  for  computation  are 
not.  Hence,  applying  fuzzy  logic,  we  can  treat  the  reliability  and  the  probability  of  failure  as  fiizzy 


□umbers.  Hence,  ft.,,  =  fti  x  R, 

(10) 

Q.,.=  1-[(1.Q,){1-Q,)] 

(11) 

where  the  intervals  are  as  follows; 


^2  [*^2b  ^22] 

[^tb  *”12] 

[q.b  qJ 

where  r,i  =  ri,  x  rj, 

r.2=  ri2X  r22 

q,i=  (1-  O 

q.2=  0-  r.i) 

The  results  of  this  analysis  are  shown  in  Table  4  and  Fig.  9. 


Table  4 


iT^b  Component  Seri 

Fuzzy  Reliability  =  =  Part  1  (see  Table  1) 

Fuzzy  Reliability  =  ft2  =  Part  2 

ft. 

fii 

fn 

ft2 

^21 

^22 

r.i 

r.i 

0%/ 

q,i 

^82 

0.9947 

0.99 

0.995 

0.9938 

0.99 

0.995 

0.9885 

0.9801 

0.99 

0.0115 

0.01 

0.0199 
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Fig.  9  Reliability  and  Failure  Probability  of  a  Two-component  Series  System 
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The  reliability  of  a  multi-unit  parallel  system  was  also  discussed  in  detail  in  previous 
sections  (Fig.  4).  Here  the  system  works  as  long  as  at  least  one  component  is  operational.  The 


system  failure  probability  can  be  written  similarly  as; 


Q^=Q,xQ2X xQ„  (12) 

This  equation  can  be  rewritten  for  a  two  component  -  one  required  (Fig.  2)  system  as: 

Q.„-Q,xQ2  (13) 


Agmn,  although  the  equation  is  precise,  the  statistical  data  used  for  the  computation  are  not. 
Hence,  considering  the  failure  probabilities  as  fiizzy  numbers,  we  can  write 

Qi  -  Q. 

Q2  ^  Q2 

(14) 

and  hence  ~  ~~ 

Q.y,  =  Q.xQ2  (15) 

where  the  intervals  are: 

Qi  [qii.  qn] 

Qi  [q2i>  q22] 

[q,i.  q,2]  (16) 

The  system  interval  [q,,,  q,2]  is  given  by 

q»i  “  qib  q2i 

qi2  “  qi2>  q22 
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(17) 


The  results  of  this  analysis  are  shown  in  Table  5  and  Fig.  10. 

e5 


Component  Parallel  System 


.  .  ^  ru 

Fuzzy  Probability  =  Qj  =  Part  1  (lee  Table  i)  Fuzzy  Probability  =  Q2  =  Part  2 


r>J 

Qi 

qn 

qi2 

Qa 

q2i 

q22 

q.1 

q.2 

0.0053 

0.00265 

0.00795 

0.0062 

0.0031 

0.0093 

3.286 

xlO-’ 

0.8215 

xlO'* 

7.394 

xlO-* 

Fig.  10  Failure  Probability  of  a  Two  Component  Parallel  System 
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It  should  be  noted  that  though  the  results  of  this  analysis  are  presented  in  the  triangular 
membership  function,  p(x),  in  practice,  the  membership  functions  could  be  a  smooth  curve  similar 
to  that  shown  in  Fig.  8.  Also,  in  the  present  work,  the  concept  of  a  fuzzy  number  is  presented 
using  the  couple:  level  of  presumption  and  interval  of  confidence.  This  is  not,  however,  the 
classical  way  of  introducing  a  fuzzy  number.  The  classical  technique  was  not  considered  in  the 
present  work;  however,  the  details  of  this  technique  are  readily  available  in  Ref  10. 

Conclusion 

A  simulation  model  for  determining  the  precision  of  reliability  was  developed  for  a 
combination  (series  and  parallel)  system  using  the  method  of  bounds.  The  model  was  validated 
v(dth  available  field  data.  It  is  well  known  that  field  data  inherently  carry  a  certain  amount  of 
uncertainty.  The  model  was  developed  further  using  the  fiizzy  number  techniques  to  account  for 
the  uncertainty  of  the  field  data. 

Acknowledgments 

The  author  would  like  to  extend  his  sincere  thanks  to  Mr.  Joseph  A.  Caroli  for  his  support 
towards  this  project  as  the  focal  point  at  the  Air  Force  Research  Laboratory,  Rome,  NY.  Thanks 
are  also  due  to  Mr.  William  Denson  of  Reliability  Analysis  Center  (RAC),  Rome,  NY,  for 
supplying  the  field  data  for  this  project  and  Ms.  Eileen  D’Agostino  for  the  care  she  took  in  typing 
this  report. 


8-31 


REFERENCES 


1.  Denson,  William  and  Keene,  Samuel,  “New  System  Reliability  Assessment  Methods.” 
RAC  Project  #A06830,  January  1996. 

2.  Denson,  William  and  Keene,  Samuel,  “A  New  System  Reliability  Assessment  Method.” 
Reliability  Review,  Vol.  15,  p.  16-20,  December  1995. 

3.  Das,  D.  K.  “Techniques  for  Determining  the  Precision  of  Reliability  Predictions  and 
Assessments”.  SFRP  Final  Report,  August  1996. 

4.  Amstadter,  B.  L.,  “Reliability  Mathematics;  Fundamentals,  Practices,  Procedures.” 
McGraw  Hill  Book  Company. 

5.  Zadeh,  L.A.,  “Fuzzy  Sets.”  Information  and  Control,  Vol.  8,  pg.  338-353,  1965. 

6.  Zadeh,  L.  A.,  “Fuzzy  Sets  as  a  Basis  for  a  Theory  of  Possibility.”  Fuzzy  Sets  and 
Systems,  Vol.  1,  No.  1,  p.  3-28,  1978. 

7.  Misra,  K.  B.  (editor),  “New  Trends  in  System  Reliability  Evaluation.”  Elsevier  Science 
Publisher,  1993. 

8.  Dempster,  A.  P.,  “Upper  and  Lower  Probabilities  Induced  by  a  Multi-Valued  Mapping.” 
Ann.  Math.  Statist.,  Vol.  38,  p.  325-339,  1967. 

9.  Shafer,  G.,  “A  Mathematical  Theory  of  Evidence.”  Princeton  University  Press,  Princeton, 
New  Jersey,  1976. 

10.  Kaufmann,  A.  and  Gupta,  M.  M.,  “Introduction  to  Fuzzy  Arithmetic:  Theory  and 
Applications.”  Van  Nostrand  Reinhold,  New  York,  1991. 

1 1 .  Bowles,  J.  B.,  “Applying  Fuzzy  Logic  to  the  Design  Process.”  Annual  Reliability  and 
Maintainability  Symposium.  Las  Vegas,  Nevada,  January,  1996. 

12.  Bowles,  J.  B.  and  Palaez,  C.  E.,  “Application  of  Fuzzy  Logic  to  Reliability  Engineering.” 
Proc.  IEEE,  Vol.  83,  No.  3,  p.  435-449,  March  1995. 


8-32 


APPENDIX 


Reliability  Data  for  Capacitors 
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tKKU-»/ 


Part  Details  3-121 


Parc  Quallcy  App  Data  Part 

D«sc.  Level  Env  Source  CharaccerlsClcs 


Capacitor. Fixed. Electrolytic. Tantalum 

Coomercial  GB  13S67-021>PI >T110D476K035AS.M£riVarious, Capacitance  Vax47.000u. 

Voltagre  Rating  s3S  .OOOv,  Pc^i443e4 
-Pf {T110C106H0S0AS,Mfri Various. Capacitance  Va 1 10 .OOOu. 

Voltage  Rating :50 .OOOv. Popi 12392 

-UPI  10180-3812 .Ktr t Various. Capacitance  Va : IS .OOOu, Voltage  Rating 1 20 .OOOv, 
Pop:100344 

-UPl >0180-3813 .KfnVarious. Capacitance  Va 1 10 . OOOu, Voltage  Rating] 10 .OOOv, 
Pop]3l072 

-UPl  1 0180-3616, Mfr :Various. Capacitance  Vai4 .700u, Voltage  Rating] 25 .OOOv. 

Popt  344 

-UPl >0180-3822 , Ktr > Various. Capacitance  Va > 39 .OOOu. Voltage  Rating > IS .OOOv, 
Pop>3436 

-UPl >0180-3827 ,Mfr> Various, Capacitance  Va > 3 . 3 OOu, Voltage  Ratingi2S .OOOv, 
Popi79620 

-UPl >0180-3831, MfriVarious. Capacitance  Va > 10 . OOOu, Voltage  Rating] 3S. OOOv, 
Pop>323S52 

-UPl >0180-38 33, M£r> Various. Capacitance  Va >22 .OOOu, Voltage  Rating >10. OOOv, 
Pop>60S6 

-UPl >0180-3834 ,KCr> Various. Capacitance  Va: 33 .OOOu, Voltage  Rating] 10 .OOOv, 
Pop]Bl€48 

-UPl >0180 -3 841. Ktr > Various, Capacitance  Va >4 .700u, Voltage  Rating 1 16 .OOOv, 

Pop>260 

-UPl  >0180-3644, Ktr > Various. Capacitance  Va>3 .300u, Voltage  Rating > 35 .OOOv, 
Pop>20960 

-UPl 1 0180-3 845, Mtr> Various. Capacitance  Va >4 .700u, Voltage  Racing i 35 .OOOv, 
Pop>38272 

-UP#  >0180-3846. Ktr] Various, Capacitance  Vai 15. OOOu, Voltage  Rating >25. OOOv. 
Pop>19264 

-UPl >0180-3847, Ktr > Various. Capacitance  Va >22 .OOOu, Voltage  Racing >25. OOOv. 
Pop>31600 

-UPl >0180-3848 .Ktr iVarlous, Capacitance  Va> 33 .OOOu. Voltage  Rating: 16 .OOOv, 
Pop>44526 

-UPl ) 0 180 -3 84 9. Kfri Various, Capacitance  Va> 47. OOOu, Voltage  Rating: 10. OOOv. 
Pop>26440 

-UPl >01 80-3850. Ktr ) Various, Capacitance  Vax 68 .OOOu, Voltage  Rating: 10 .OOOv, 

Pop: 36500 

-PI > 195O224X0035S3 .Ktr > 3K  Co. .Capacitance  Va> .220u. Voltage  Rating >35. OOOv. 
Pop>2472 

-PI  >T356H226H025AS.Ktr:Keinet  Electronics  Corp.  .Capacitance  Va>22.000u, 

Voltage  Rating>25 .OOOv, Pop:31796 

-UPl >0180-3887. Ktr xVarious, Capacitance  Va > 15 .OOOu, Voltage  Rating >20 .OOOv. 

Popt44 

-UPl >0180-3888, Ktr>Various, Capacitance  Va>15. OOOu. Voltage  Rating>25.000v, 
Pop>5184 

-UP#  >0160-3889. Ktr iVarious. Capacitance  Va  >  6 .600u, Voltage  Rating; 35 .OOOv. 
Pop>2592 

-UPl  >0180-3922 .Ktr  > Various. Capacitance  Va> IS .OOOu. Voltage  Rating  >20 .OOOv, 

Pop>  86968 

-PI >T396J107K010A3.KtrtKeinet  Electronics  Corp. .Capacitance  Va>100.000u. 

Voltage  Rating: 10 . OOOv, Pop>9824 

-PI )195D105X0035S3.Ktr:3H  Co. .Capacitance  Va : 1 .OOOu, Voltage  Rating>35.000v, 

Pep]  824 

-UPl i0180-3965.Kfr iVarlous, Capacitance  Va > 47 .OOOu. Voltage  Ratlng>6 .300v. 
Pop>llB8 

-PI iT398E106K025AS.Kf rxXemet  Electronics  Corp. .Capacitance  Va: 10. OOOu. 

Voltage  Rating>25 .OOOv. Popi6e000 

-UPl > 0180-4015. Ktr > Various. Capacitance  Va >22 .OOOu, Voltage  Rating: 16 .OOOv, 
Pop>13668 

-PI >T398C47SK050AS.Ktr >Kemet  Electronics  Corp. .Capacitance  Va:4.700u. 
voltage  Rating >50 .OOOv, Pop]20048 

-UPl >0180-4031 , Ktr > Various. Capacitance  Va >10 .OOOu, Voltage  Rating >35 .OOOv. 
Pop>232 

-PI >T398B105K050XS.Ktr iXemet  Electronics  Corp. .Capacitance  Vaxl.OOOu, 
voltage  Rating :  50 .  OOOv,  Pop  >260  ^ 

-UPl > 0180-0022. Ktr >Varlou5, Capacitance  Va ; 3 . 900u, Voltage  Rating: 35 .OOOv. 

Popi9216  ^ 

-UPl > 018 0-0097, Ktr> Various, Capacitance  Va > 47 .OOOu, Voltage  Rating; 35 .OOOv. 

-UM 1 0180-0098 , Ktr iVsrious. Capacitance  Va > 100 .OOOu, Voltage  Ratingi20 . OOOv, 

-UM >0180-0100 , Ktr > Various. Capacitance  Va : 4 .700u, Voltage  Rating; 35 . OOOv, 
pop>863596  ,  ^  ^  ,  ««« 

-UP! > 0180-0106, Ktr > Various. Capacitance  Va ; 60 .OOOu, Voltage  Rating; 6 . OOOv, 
PoP«224328 

-UP# >0180-0113, Ktr iVarlous. Style :Wet  slug. Capacitance  Va> 100. OOOu. 
voltage  Ratlng:30, OOOv, Pop:5044 

-UPl : 0180-0116 , Ktr ; Various. Capacitance  Va :6 . 800u, Voltage  Rating: 35 .OOOv, 
Pop:2675700 

-UP# :0180-0117. Ktr : Various. Capacitance  Va >2 .700u, Voltage  Rating: 35 .OOOv, 

-UP# :0180-0137 , Ktr iVarious . Capacitance  va : 100 . OOOu. Voltage  Rating : 10 .OOOv, 

Pop : 3  0  5  8  0 

-UPl : 0180-0155. Ktr : Various , Capacitance  Va ; 2 .2 OOu. Voltage  Rating: 20 .OOOv, 

Pop;205212  ,  . „  „„„ 

-UPl : 0180-0159 .MCr ; Various . Capacitance  Va ;220 , OOOu. Voltage  Racing : 10 . OOOv, 

Pop;84164  ^ 

-UPl : 01 80 -0160, MCr iVarlous , Capacitance  Va : 22 . OOOu. Voltage  Racing : 35 -OOOv, 
Pop:80524  ^ 

-UPl ; 0180-0161, Mfr : Various . Capacitance  Va : 3 . 300u, Voltage  Racing : 35 .OOOv. 

Pop:79016  ^ 

-UPl : 0180-0194 .Mfr iVarlous .Capacitance  Va : 150 -OOOu, Voltage  Racing: 15 .OOOv, 
Pop:8416 

-UPl : 0180-0195 , MCr iVarlous . Capacitance  Va : . 330u. Voltage  Rating: 35 . OOOv. 

-UM : 0180-0196, Mfr iVarlous, Capacitance  Va :56 .OOOu. Voltage  Rating : 15 .OOOv. 

-UP^: 0180-0197 .Mfr iVarlous . Capacitance  Va :2 .200u, voltage  Rating; 20 .OOOv, 

-UP^ ; 0180-0210. Mfr iVarlous . Capacitance  Va : 3 . 300u. Voltage  Rating: 15 .OOOv. 

-UPl : 0180-0216 .Mfr iVarlous .Capacitance  Va : 12 . OOOu. Voltage  Rating ; 35 . OOOv. 
Pop:792  _ _ 

Reliability  Analysis  Center  (RAC)  •  201  Mill  St.,  Rome,  NY  13440-6916 


Pail /Hours 
or  Miles  (E6) 


4/1122.6748 


16/3478.4100 


4/4211.8648 


315-337-0900 


EPRD-97 


Part  Details  3-117 


Part  Quality  App  Data  Part  Pall /Hours 

Dasc.  Laval  Cnv  Sourca  Characteristics  or  Milas  (E€) 


Capacitor. Fixed*  Elactrolytic. Tantalum 

Comaarclal  CB  13567-021-UPI  i0180>0630.Kfr :Varlous, Capacitance  Va : 4 .700u. Voltage  RatlngtSO.OOOv. 
Pop: 100920 

-UPl  i 0160-0642. Mfri Various, Capacitance  Va  t IS . OOOu, Voltage  Rating : 20 .OOOv, 
Popil2644 

-UPt  >0180-0648,Mfr : Various. Capacitance  Va : . lOOu, Voltage  Racing : 35 .OOOv, 
Popai424 

-UPl  s 0180 -067 8. Mfr i Various. Capacitance  Va : 220 . OOOu, Voltage  Racing: 3 .OOOv. 
Pop:127l2 

-PI :KZC-015-825R-10.Kfr:AVX  Corp .. Capacitance  Va:6.200u, 

Voltage  Rating: IS .OOOv, Pop: 12712 

-PI  :K2A-006-685-R20.Mtr*AVX  Corp  . ,  Cap>aci  tance  Va:6.800u, 

Voltage  Rating: 6. OOOv, Pop: 108 

-UPl  1 0180-0690, Kfr : Various, Capacitance  Va i2 .200u, Voltage  Rating: 20 .OOOv, 
PopiS 

-UPl  1 0180-1701. Mfn Various, Capacitance  Va : 6. 800u, Voltage  Rating: 6. OOOv. 
Pop:189440 

-UPl  X 0180-1702 , Kfr tVarlous. Capacitance  Va : 180 .OOOu. Voltage  Ratlngx6 .OOOv, 
Popi8472 

-UPl  1 0180-1704. Kfr xVarlous. Capacitance  Va:47 . OOOu. Voltage  Ratlngi6 .OOOv, 
Poptl25896 

-UPl  t0180-l706,Kf nVarlous, Style xWet  slug, Capacitance  Va 1 100 .OOOu, 
voltage  Ratlngt2S.000v.Pop:22276 

-UPl  1 0180-1713 , Kfr xVarlous, Capacitance  Va : .S60u, Voltage  Rating: 35 .OOOv. 
Pop:3112 

-UPl  1 0180-1714. Kfr xVarlous, Capacitance  Va: 330 .OOOu, Voltage  Rating : 6 .OOOv, 
Popt60444 

-UPl : 0180- 1715, Kfr  xVarlous, Capacitance  Va 1 150 .OOOu, Voltage  Rating: 6 .OOOv, 
Popt3064e 

-UPl  x0180-1718.Kfr  xVarlous, Capacitance  Va : 56 .OOOu, Voltage  Rating: 20 .OOOv. 
Pop:3112 

-UPl  t0160-1731.Mfr xVarlous, Capacitance  Va :4 . 7 OOu. Voltage  Rating: 50 . OOOv, 
Popi3914S2 

-UPl  10180-1735, Kfr iVarlous, Capacitance  Va: .220u. Voltage  Rating: 35 .OOOv. 
Popi223108 

-UPl  x0180-1743.Kfr xVarlous, Capacitance  Va : . lOOu. Voltage  Ratlngi35 . OOOv, 
Pop:150736 

-UPl  X 01 80- 1745. Kfr 1 Various, Capacitance  Va : 1 . SOOu, Voltage  Rating: 20 .OOOv. 
Popil40872 

-UPl  1 0180 -1746. Kfr : Various. Capacitance  Va : 15 .OOOu, Voltage  Rating: 20 .OOOv, 
Popi2551404 

-UPl  1 0180-1747 , Kfr tVarlous. Capacitance  Va : 150 . OOOu. Voltage  Rating : 15 . OOOv, 
Pop:14856 

-UPl  1 0180-1759, Kfr xVarlous, Capacitance  Va i .056u, Voltage  Rating: 35 .OOOv, 
Pop: 100 

-UPl  t0160-1760.Mf r  tVarlous, Capacitance  Va : . 120u, Voltage  Rating : 35 . OOOv, 
Pop: 100 

-UPl  1 0180-17 61, Kfr  tVarlous, Capacitance  Va : .560u. Voltage  Rating: 35 .OOOv, 
Popt39624 

-UPl  i0180-1775,Kfr xVarlous, Capacitance  Va : 1 . 800u. Voltage  Rating: 35 .OOOv, 
Poptl316 

-UPl  1 0180 -1779, Kfr: Various, Capacitance  Va : 18 . OOOu. Voltage  Rating: 35 .OOOv, 
Pop:lS264 

-UPl t0180-1794 .Kfr tVarlous. Capacitance  Va :22 .OOOu, Voltage  Rating: 35 .OOOv. 
Popx228388 

-UPl  i0180-1615,Kfr xVarlous, Capacitance  Va :7 . SOOu. Voltage  Rating: 20 .OOOv, 
Popt4e48 

-UPl :0180-1834 , Kfr  tVarlous. Capacitance  Va : 15 . OOOu. Voltage  RatlngxSO .OOOv, 
Popx28024 

-UPl  10180-1835. Kfr tVarlous, Capacitance  Va : 68 .OOOu, Voltage  Rating: 15 .OOOv. 
Popt24172 

-UPl X 0180-18 46 .Kfr: Various, Capacitance  Va :2 .200u, Voltage  Rating: 35 .OOOv. 
Poptll9740 

-UPl X 0180- 1661, Kfr tVarlous. Capacitance  Va : 27 .OOOu. Voltage  Rating: 10 .OOOv, 
Popx53184 

-UPl  1 0160 -194 0 , Kfr : Various. Style :Wet  slug, Capacitance  Va: 33 .OOOu, 

Voltage  Rating: 10 .OOOv. Pop: 3752 

-UPl : 0180-1954 , Kfr : Various , Capacitance  Va :4 . 700u. Voltage  Rating : 6 . OOOv, 
Pop:7072 

-UPl  1 0180 -1974 .Kfr : Various, Capacitance  Va : 10 . OOOu, Voltage  Rating ; 35 . OOOv, 
Pop:442e0 

-UPl : 0180-1980. Kfr :Various . Capacitance  Va : 1 . OOOu, Voltage  Rating : 35 . OOOv, 
Pop:29388 

-UPl  t0180-2050.Kfr : Various. Capacitance  Va : . 08 2u. Voltage  Rating:35.000v, 
Pop:5860 

-UPl : 0180-2060, Kfr  tVarlous, Capacitance  Va : 40 . OOOu, Voltage  RatingxlO.OOOv, 
Pop: 19252 

-UPl  :0ie0-2062 , Kfr : Various . Capacitance  Va : 120 . OOOu. voltage  Rating : 10 .OOOv. 
Pop: 500 4 

-UPl  :0180-2071 , Kfr tVarlous. Capacitance  Va : . 022u, voltage  Rating : 35 . OOOv, 
Pop:9280 

-UPl : 0180 -2076 , Hf r : Various. Capacitance  Va : . 06 8u, Voltage  Rating : 35.000V. 
Pop: 36 

-UPl : 0180-2079. Kfr tVarlous, Capacitance  Va ; . 390u. Voltage  Rating ; 35 . OOOv. 
Pop:592 

-UPl : 0180-2 104 . Kf r : Various. Style : Wet  Slug. Capacitance  Va ; 40 . OOOu, 

Voltage  Rating : 30 . OOOv, Pop : 3488 

-UPl s 0180-2111, Kfr  tVarlous, Capacitance  Va : 33 . OOOu. Voltage  Racing:35.000v, 
Pop: 7200 


0/131.1960 

0/16.4372 

0/14.8512 

0/16.5256 

0/16.5256 

0/0.1404 

0/0.0104 

0/246.2720 

0/11.0136 

0/163.6648 

16/28.9588 

0/4.0456 

0/78.5772 

0/39.8424 

0/4.0456 

8/508.8876 

4/290.0404 

0/195.9568 

0/183,1336 

36/3316.8252 

0/19.3128 

0/0.1300 

0/0.1300 

0/51.5112 

0/1.7108 

0/19.8432 

0/296.9044 

0/6.3024 

0/36.4312 

0/31.4236 

0/155.6620 

0/69.1392 

0/4.8776 

0/9.1936 

0/57.5640 

0/38.2044 

0/7.6180 

0/25.0276 

0/6.5052 

0/12.0640 

0/0.0468 

0/0,7696 

0/4.5344 
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3-116  Part  Details 


EPRD-97 


Parc  Qua 1 Icy  App  Data  Parc 

Dasc.  Laval  Env  Sourc®  Characcariscics 

Capacitor . rixad. Elaccrolytlc, Tantalum 

Coomarcial  CB  13567-021-UPI  i OlBO-0218 . Kir s Various , Capacitance  Va 
Pop:39004 

-DPI j0180-0228.Kfr : Various. Capacitance  Va 
Pop!2047428 

-UP! !0180-0229.Kfr: Various, Capacitance  Va 
Pop: 1930224 

-UPl 50180-0230. Hfr : Various , Capacitance  Va 
Pop:574664 

-UPl ! 0180-0232, Mfr: Various, Style: Wet  Slug 
Voltage  Rating : 100 . OOOv, pop : 388 

-UPl 1OI8O-O234. Min Various. Style; Wet  Slug 
Voltage  Rat ingi 75 . OOOv. Pop ; 16256 

-UPl : 0180-0235, Mf r 1 Various . Style : Wet  Slug 
Voltage  Rating : 75 . OOOv, Pop : 2748 

-UPl  1 0180-0291. KCr I  Various, Capacitance  Va 
Popi4220396 

-UPl i0180-0309,Mfr  5 Various, Capacitance  Va 
Popi661456 

-UPl »0180-0347, KCr sVarlous. Capacl tance  Va 
Popil2340 

-UPl i0180-0349,Kfr : Various , Capac It ance  Va 
Popi7332 

-UPl i0180-0354,MfriVarious. Capacitance  Va 
Pop«5940 

-PI  1 151D345X0035X2 ,Ktr 1 3H  Co. .Capacitance 
Popi2456 

-UPl 5 0180-0356 .Kfr I Various , Style : Wet  Slug 
Voltage  Rating : 15 . OOOv, Pop : 448 

-UPl f 0180-0373. KCr 5 Various. Capacitance  Va 
Popj205040 

-UPl i0180-0374.HCr 5 Various, Capacitance  Va 
Popi2314796 

-UPl i0180-0375,KCri Various, Capacitance  Va 
Pop>39308 

-UPl i0180-0376,HCr5 Various, Capacitance  Va 
Popi269584 

-UPl i0180-0387,Kfri Various, Capacitance  Va 
Popi 4180 

-UPl i0180-0388.KCri Various. Style: Wet  Slug 
Voltage  Rating : 20 , OOOv. Pop : 3 8 9 6 

-UPl t0180-0393, KCr: Various, Capacitance  Va 
Popt64564 

-UPl : 0180-0405, KCr: Various, Capacitance  Va 


150u, Voltage  Rat Ing : 35 . OOOv, 
;22.000u. Voltage  Ra ting :15. OOOv, 

: 33 . OOOu. Vol tage  Rat Ing : 10 . OOOv, 

: 1 . OOOu . Vol tagc  Rat Ing : 50 . OOOv . 
.Capacitance  Va:10.000u. 
.Capacitance  Va:33,OOOu, 
.Capacitance  Va;56.000u. 

5 1 . OOOu. Vol cage  Rat Ing : 3 5 . OOOv. 

: 4 . 700u, Voltage  Rat ing : 10 . OOOv. 

1 1 . 500u , Voltage  Rating  535. OOOv. 

5 . 820u , Voltage  Rat ing ; 35 . OOOv. 

: 40 . OOOu, Voltage  Racing t 10 . OOOv, 

Va 5 3 . 400u. Voltage  Rat ing t 3 5 . OOOv . 
.Capacitance  Va:70.000u. 

680u. Vol tage  Racing : 35 . OOOv . 

:  10 . OOOu , Voltage  Ra t Ing : 20 . OOOv . 
;68. OOOu. Vol tage  Rating  520. OOOv, 

5 . 470u, Voltage  Rating  5 35 , OOOv. 

;47. OOOu. Vol tage  Ratlng:20.000v, 
.Capacitance  Va:56.000u. 

:39. OOOu. Voltage  Rating : 10 . OOOv. 

:  1 . 800u . Vol tage  Ra t Ing : 2 0 . OOOv, 


Pop:49976 

-UPIi0180-0407,KCr 
Voltage  Ratlng530 
-UPl 10180-0415. KCr 
Pop  5  3560 

-UPl 50180-0418, KCr 
POP589824 

-UPl 50180-0428. KCr 
Pop5l5012 

-UPl 50180-0437. KCr 
Pop563144 

-UPl 10180-0451. Kfr 
Pop541340 

-PI IT376C105K020AS 
Voltage  Ratingt20 
-UPl 50180-0474. Kfr 
Pop5l503l6 
-UPl i01B0-0479.Hfr 
Pop:9792 

-UPl s01B0-0481.Kfr 
Voltage  Rating:20 
-UPl 10180-0486. Hfr 
POP52372 

-UPl  50180-0490. Kfr 
Popt36832 

-UPl  50180-0491  .Hfr 
Pop  5  686180 
-UPl 50180-0500. Kfr 
Pop:49760 

-UPl 50180-0548, Kfr 
Popi  4 

-UPl 50180-0552, Kfr 
Pop;41956 

-UPl :0180-0553.MCr 
Pop;179028 
-UPl s01B0-0558,Mfr 
Pop:4364 

-UPl 50180-0562, Hfr 
Pop  5  183348 
-UPl :0180-0575,Kfr 
Pop: 1800 

-UPl 50180-0594 , Mfr 
POP563276 

-UPl :0180-0597.Mfr 
Pop  5  2 1 8  0 


:Var loui , Style  5  Wet  Slug . Capac 1 tance  VailBO.OOOu. 

.OOOv. Pop: 12208 

sVarious, Capacitance  Va : 1 0 . OOOu . Vol cage  Rating : 25  .  OOOv. 

iVarious, Capacitance  Va : 1 . OOOu . Voltage  Rating : 35 . OOOv, 

5 Various, Capacitance  Va : 68 . OOOu . Vol cage  Rat Ing : 6 . OOOv. 

5 Various, Capacitance  Va s 3 9 . OOOu , Vol cage  Rat Ing : 1 0 . OOOv. 

5Varlous. Capacitance  Va 1 3 . 300u . Vol cage  Ra c Ing : 20 . OOOv. 

.Hfr:X«met  Electronics  Corp. .Capacitance  Va:1.000u, 

.  OOOv. Pop : 8 

5Varlous, Capacitance  Va 5 15 . OOOu . Vol cage  Ra t ing ; 2 0  .  OOOv . 

I  Various , Capacitance  Va i 47 , OOOu, Voltage  Rat ing : 2 5 . OOOv , 

5  Various. Style  5  Wet  Slug , Capac 1 tance  Va:100.000u, 

.OOOv, Pop: 3016 

5 Various, Capacitance  Va s 1 0 . OOOu , Vol cage  Rat Ing : 2 0 . OOOv . 

5 Various . Capacitance  Va 5 68 . OOOu. Voltage  Rat Ing : 6 . OOOv , 

5 Various . Capacitance  Va : 1 0 . OOOu , Vol tage  Ra c 1 ng ; 2 5 . OOOv , 

: Various . Capacl tance  Va : 4 7 . OOOu . Vo  1 tage  Rat Ing : 2 0 . OOOv , 

: Various , Capacl tance  Va ; 56 . OOOu , vol tag©  Rat Ing : 6 . OOOv, 

; Various , Capac Itance  Va : 2 2 0 . OOOu , Vol tage  Rating : 10 . OOOv. 
:  Various , Capac Itance  Va : 22 . OOOu , Voltage  Ra 1 1 ng : 2 5 . OOOv , 

; Various , Capacitance  Va : 4 70 . OOOu , Vo  1 tage  Ra t i ng : 1 0 . OOOv , 
;Varlous . Capacl tance  Va : 3 3 . OOOu . Vo  1 tage  Rat  1 ng : 1 0 . OOOv, 
:Various , Capacl tance  Va ; 2 . 2 OOu . Vo  1 tage  Ra t Ing : 1 5 . OOOv , 

:  Various. Capacitance  Va : 3 . 3 OOu . Vo  I tage  Rat Ing : 1 5 . OOOv, 

: Various , Capacl tance  Va ; 2 2 . OOOu , Vo  1 tage  Ra 1 1 ng : 5 0 . OOOv , 


Eail/Kours 
or  Miles  (E6) 

0/50.7052 

8/2661.6564 

4/2509.2912 

12/747.0632 

0/0.5044 

0/21.1328 

0/3.5724 

16/5486.5148 

0/859.8928 

0/16.0420 

0/9.5316 

0/7.7220 

0/3.1928 

0/0.5824 

0/266.5520 

20/3009.2348 

0/51.1004 

0/350.4592 

0/5.4340 

0/5.0648 

0/83.9332 

0/64.9688 

0/15 . 8704 

0/4.6280 

4/116.7712 

0/19.5156 

0/82.0872 

0/53.7420 

0/0.0104 

0/195.4108 

0/12.7296 

0/3.9208 

0/3.0836 

0/47.8816 

0/892.0340 

0/64 .6880 

0/0.0052 

0/54.5428 

4/232.7364 

0/5.6732 

0/238.3524 

0/2.3400 

0/82.2588 

0/2.8340 
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3-118  Part  Details 


EPRD-97 


P*rt  Quality 
Desc .  L«v*l 


App  Data 
Env  Source 


part 

Characteristics 


Fail /Hours 
or  Miles  (E6) 


Capacitor, Fixed, Electrolytic, Tantalum 

Cooinercial  CB  13S67>021-UPI i0180'212S,K£riVarlous, Capacitance  Va t IS . OOOu. Voltage  Rating :20.000v, 
Popi2248 

-UPf  <0180 -2 126, Kir  t Various. Capacitance  Va 1 1 .SOOu. Voltage  Rating : 35 . OOOv. 
Pop<8976 

-UPl  i0180*2127,M£r : Various, Capacitance  Va : . ISOu, Voltage  Racing : 35 , OOOv, 

Pop  1 9244 

-UPl  <0180-2129 ,K£r iVarious, Capacitance  Va  < 10 . OOOu. Voltage  Racing : 50 . OOOv, 

Pop <19704 

-PI <109D106X0060C2-DYP.Mtr: Sprague  Electric  Co. .StyleiWet  Slug. 

Capacitance  Va< 10 .OOOu, Voltage  Rating: 60 . OOOv. Popt 47312 
-UPl <0180-2140,MfriVarlous, Capacitance  Va :5 .600u. Voltage  Rating :50 . OOOv, 

Pop< 12720 

-UPl < 0180-2141, Ml r< Various, Capacitance  Va < 3 .300u. Voltage  Racing <50 . OOOv. 
Pop<1589S2 

-UPl <0180-2148. HfrtVarious, Capacitance  Va< . 470u, Voltage  Rating <50. OOOv. 
Pop<6368 

-UPl <0180-2161. K£r<Various. Capacitance  Va < .750u. Voltage  Rating <50 .OOOv, 
Popil6040 

-UPl 1 0160-2178, K£r:Various. Style iWet  Slug. capacitance  Va <220. OOOu. 

Voltage  Rating <8. OOOv, Pop <3220 

-UPl  t01B0-2182.K£r<Varlou8, Capacitance  Va <18 .OOOu. Voltage  Racing <50 .OOOv. 
P(^<14268 

-UPl 1 0180-2166, MfrtVarious, Style <Wec  Slug. Capacitance  Va: 300. OOOu. 

Voltage  Rating<30 .OOOv, P<^<9200 

-UPl <0180-2192, K£r<Varlous, Capacitance  Va <6 .600u, Voltage  Rating 1 50 .OOOv, 
Pop<2856 

-UPl  <0180-2195, M£r< Various. Capacitance  Va: IS. OOOu. Voltage  Rating <35. OOOv. 
Popi27916 

-UPl  <0180-22 01, H£r< Various, Capacitance  Va< .680u, Voltage  Rating :75 .OOOv. Pop <8 
-UPl < 0160-2204, K£r< Various, Capacitance  Va: 10. OOOu, Voltage  Rating > 10 . OOOv. 
Pop<1820 

-UPl  <0180 -2205, M£ri Various, Capacitance  Vat . 3 30u, Voltage  Rating: 35. OOOv. 
Pop<180516 

-UPI<TA  ELECTROLimCB.M£r< Various, Capacitance  Vat 60. OOOu, 

Voltage  Ratlngt6.000v,Popil84340 

-UPl  <0180 -2207, K£r< Various. Capacitance  Va < 100 . OOOu, Voltage  Racing: 10. OOOv, 
Pop<286280 

-UPl <0180-2208, MfrtVarious, Capacitance  Vat220. OOOu. Voltage  Rating <10. OOOv. 
Pop<161604 

-UPl <0180-2247, MfrtVarious. Capacitance  Va : 10 . OOOu. Voltage  Rating: 20 .OOOv, 
Popt6236 

-UPl <0160-2249. MfrtVarious, Capacitance  Va:47. OOOu. Voltage  Rating <20. OOOv. 
PoptlS8224 

-UPl <01 80 -2255, MfrtVarious, Capacitance  Va <2 .200u, Voltage  Rating: 20 . OOOv, 
Pop<S3192 

-UPl <0180 -2264. MfrtVarious, Capacitance  Va : 3 . 300u. Voltage  Rating: 15 . OOOv. 
Pop<58676 

-UPl <0180-22 66. MfrtVarious, Style: Wet  Slug, Capacitance  Va < 140 .OOOu. 

Voltage  RatingiSO. OOOv, Pop: 1832 

-UPl 1 0180-2275. MfrtVarious, Style tWec  Slug. Capacitance  Va <220 .OOOu. 

Voltage  Racingi8.000v.Pop<16936 

-UPl <0180-2333 .Mfr tVarious, StyletWeC  Slug. Capacitance  Vat 330. OOOu. 

Voltage  Rating t IS. OOOv, Pop <7620 

-UPl <0180-2338 , Mfr  tVarious. Style :Wet  Slug. Capacitance  Va i6S0 , OOOu, 

Voltage  Rating t 13 .OOOv, Popt 11000 

-UPl  <0180-2358, Mfr tVarious. Capacitance  Va  < 150 .OOOu. Voltage  Rating: 6 .OOOv. 
Popt4 

-UPl 10180-2374 , Mfr tVarious, Capacitance  Va < 100 . OOOu. Voltage  Ratingt20 .OOOv. 
Poptl9000 

-PI  tT370C225K025AS,MfrtKeiDet  Electronics  Corp ., Capacitance  Va<2.200u. 

Voltage  Ratlngt25.000v.Popt20S88 

-UPl t 0180-2 378, Mfr tVarious, Capacitance  Va t6 .eoOu. Voltage  Rating: 10. OOOv, 
Pop<41176 

-UPl <0180-2419,Kfr tVarious, Style: Wet  slug, Capacitance  Va< 470 .OOOu, 

Voltage  RatingtlO.OOOv, Pop<2952 

-UPl <0160-2474 .Mfr tVarious , Style tWet  slug. Capacitance  Va 1 15 . OOOu, 

Voltage  Ratlngt20.000v, Pop:20 

-UPl t0180-2466.Mfr tVarious , Style : Wet  slug. Capacitance  Va : 470 . OOOu. 

Voltage  Rating: 30 .OOOv, Pop t 14224 

-UPl :0180-2505. Mfr tVarious. Capacitance  Va:l .OOOu. Voltage  Rating:75 . OOOv. 
Pop:35284 

-PI :MMF-002-106A-20.Mfr<AVX  Corp .. Capacitance  Va:10.000u. 

Voltage  Rating :2 .OOOv. Pop : 32 

-UPl <0180-2514 , Mfr tVarious. Capacitance  Va :2 .200u. voltage  Rating : 20 .OOOv. 
Pop:220B 

-UPl <0180-2515 .Mfr tVarious. Capacitance  Va : 47 .OOOu. Voltage  Rating :6 . OOOv, 

Pop  <88028 

-UPl <0180-2545. Mfr  tVarious. Capacitance  Va : 100 . OOOu, Voltage  Rating : 4 .OOOv. 

Pop <50564 

-UPl :0180-2549.Mfr tVarious. Capacitance  Va ; 100 . OOOu. Voltage  Rating : 20 . OOOv. 
Pop:52B 

-UPl <0180 -2 597, Mfr : Various . Style : Wet  Slug. Capacitance  Va : 270 . OOOu. 

Voltage  Rating:2S .OOOv, Pop: 8952 

-PI ;MD7-00fl-476-20/9038.M£r:AVX  Corp .. Capacitance  Va:47.000u. 

Voltage  Rating: 8 .OOOv, Pop:7928 

-PI;MAZ-035-684-R20.Mfr:AVX  Corp . . Capaci cance  Va;.680u, 

Voltage  Rating; 35 . OOOv. Pop: 12 

-UPl : 0180-2610, Mfr : Various, Capacitance  Va ; 10 . OOOu .Voltage  Racing ; 75 . OOOv, 
Popt 18780 


0/0.0104 

0/2.3660 
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EPRD-97 


Part  Details  3-119 


Part  QuAlity  App  Data  Parc 

0«»c.  Laval  Env  sourca  Characcaristics 

Capacitor. Pixad, Elaccrolytic, Tantalum 

Cc3oinarcial  CB  13567-021-UPI 1 0180-2613  .KCr  iVarlout.  Capacltanca  Va  z  390  .  OOOu.  Voltaga  Rat  tng  s  6  .  OOOv. 
Popi4728 

-UP!  10180-2614, KCr I  Various . Capacltanca  Va 1 100 . OOOu. Voltaga  Rating : 30 . OOOv, 
Popi50940 

-PI 1MD7-010-226-20/9038 .Mf r jAVX  Corp .. Capacltanca  Vaj22.000u, 

Volcaga  Racing i 10 . OOOv. Pop ? 6148 

-UPl 10180-2616, K£ri Various. Capacltanca  Vat60.000u. Volcaga  Racing i6. OOOv. 
Popil0532 

-UPl  1 0180-2617, Kfr I  Various, Capacltanca  Va 1 6 . 8 OOu. Volcaga  Rad ng;35. OOOv. 
Popi344136 

-UPl i0l80-2618,K£r I  Various. Capacltanca  Va i 33 . OOOu. Voltage  Rat Ing 1 10 . OOOv. 
Popil72656 

-UPl 10180-2619 ,K£r i Various. Capacltanca  Va i22 . 00 Ou, Volcaga  Racing : IS . OOOv. 
Popil38eS6 

-UPl  1 01 80 -2 62 0,M£ri Various, Capacltanca  Va i2 . 2 OOu, Volcaga  Racing  t  50 . OOOv. 
Popi326404 

-UPl tOllO-2623 .K£ri Various, Capacltanca  Va 1 12 . OOOu, Volcaga  Racing  «  6 . OOOv. 

Pop i55l76 

-UPl 1 0180-2 66 l.Ktn Various , Capacltanca  Va 1 1 .OOOu, Volcaga  Racing 1 50 . OOOv, 
PopilB9368 

-UPl 1 0180-2662 .K£r I Various .Capacltanca  Va 1 10 . OOOu, Volcaga  Rating 1 10 . OOOv. 
Popill3368 

-UPl 10180-2664 , Kfr i Various , Capacltanca  Va i 3 . 3 OOu, Volcaga  Racing i 15 . OOOv. 
Popill72 

-UPl  1 0180-2667, K£r I Various, Capacltanca  Va 1 150 . OOOu. Voltaga  Rating  » 20 . OOOv. 
Popi3B076 

-PI jKD6-035-475-20/9038,MfriAVX  Corp. .Capacltanca  Vai4.700u, 

Voltaga  Raclngt 35 . OOOv, Pop i 66856 

-PI iHD7-008-336-20/9038,M£r «AVX  Corp .. Capacltanca  Vaj33.000u. 

Volcaga  Ratlngi 8 , OOOv. Pop i 1892 

-UPl  1 0180-2688 .Kfr I Various, Capacltanca  Va i 8 .200u. Voltaga  Rating j 20 .OOOv. 

Pop  1 12 

-UPl 1 0180-2690. Kfr I Various. Capacltanca  Va 1 3 . 3 OOu. Volcaga  Rating j 15 . OOOv. 
Pop«l76392 

-UPl  1 0180-2692 , Kfr I Various, Capacltanca  Va i 66 . OOOu, Volcaga  Rating 1 15 , OOOv, 
popt4ei2 

-UPl I  0180-2697, Kfr I Various, Capacltanca  Va 1 10 , OOOu, Volcaga  Rati ngi25. OOOv. 
Popi341376 

-UPl i0180-2698.Kfr I Various, Capacltanca  Va i4 .700u. Voltaga  Rating: 35 .OOOv, 
Popt28696 

-PI jK5-020-22S-10/9038.KfriAVX  Corp .. Capacltanca  Vaj2.200u. 

Voltaga  Ratlngi20 . OOOv. Pop i 30516 

-UPl I  018 0-27 4 3. Kfr I Various, Capacltanca  Va i . 1 lOu , Voltaga  Racing i 35 . OOOv, 
Popjll2e 

-UPl I  0180-2764 .Kfr I Various .Capacltanca  Vail. OOOu, Voltaga  Racing i 35 , OOOv. 
Popil700 

-PI IKD7-020-156-20/9038 ,Kf r lAVX  Corp Capacltanca  VailS.OOOu, 

Voltaga  Rating i 20 . OOOv, Pop i 33 436 

-UPl i0180-2781,Kfr I Various. Capacltanca  Va i 39 . OOOu, Voltaga  Rating  j 10 . OOOv. 
Popil2828 

-UPl I  0180-2794 .Kfr  t Various. Capacltanca  Va i 3 . 3 OOu , Voltaga  Rating i 35 . OOOv, 
Popi6960 

-UPl I  0180-2795 , Kfr I Various, Capacltanca  Va i 39 .OOOu. Voltaga  Rati ngilS. OOOv. 
Popi7480 

-UPl I  0180-2811 .Kfr I Various, Capacltanca  Va 1 10 . OOOu, Vol Caga  Racing i 35 .OOOv. 
Popil29l20 

-PI t394D684X0025B3,Kfrt3K  Co .. Capacltanca  Va i .680u. Volcaga  Rat Ing : 2 5 . OOOv, 
Popi8960 

-UPl  1 0180-2814 .Kfr I Various, Capacltanca  Va i22 .OOOu. Volcaga  Rating  1 10 . OOOv. 
Popi 87336 

-UPl 10180-2815, Kfr I Various. Capacltanca  Va i 100 . OOOu, Voltaga  Rating : 10 . OOOv, 
Poptl70696 

-UPl I  0180-2816 .Kfr I Various, Capacltanca  Va  t  68 . OOOu, Voltage  Rati ngilO. OOOv, 
Popi61604 

-UPl I  0180-2817 , Kfr I Various .Capacitance  Va  t  47 . OOOu. Voltage  Racing : 10 . OOOv. 
Popi43056 

-UPl I  0180 -2818. Kfr I Various. Capacltanca  Va i 2 . 200u , Vol cage  Rating : 35 . OOOv, 
Pop:227612 

-UPl t  0180-2819 .Kfr I Various, Capacitance  Va i , 470u. Volcaga  Rat Ing : 3 5 . OOOv . 
Pop:12580 

-UPl  10180-2820, Kf r  t Various , Capacitance  Va:.220u. Voltage  Racing; 35. OOOv, 
Popi22152 

-UPl I  0180-2821 . Kfr t Various. Capacitance  Va :22 , OOOu, Voltaga  Rati ng:35. OOOv, 
Popil91384 

-PI  I 550D476X0035S2 , Mf r i 3M  Co . . Capaci Canca  Va ; 4 7 . OOOu . Vol caga  Rat ing : 35 . OOOv. 
Pop: 1760 

-PI  1 109D826X9075F2 ,Mf r I  Sprague  Electric  Co .. Style i Wet  Slug, 

Capacitance  Va i 82 , OOOu . Vol cage  Rat Ing i 75 . OOOv, Pop i 4 3 €0 
-PI  1 109D106X0100C2 , Kfr : Sprague  Electric  Co .. Style iWec  Slug, 

Capacitance  Va 1 10 , OOOu , Vol caga  Rat Ing 1 100 - OOOv . Pop : 4088 
-PI  1 1090686X9 100T2 .KfrtSpragua  Electric  Co ,. Style : Wet  Slug . 

Capacitance  Va i 68 . OOOu . Vol tage  Ra t ing : 1 00 . OOOv, Pop : 4 700 
-UPl : 0180-2900. Mfr I  Various. Capacltanca  Va:15. OOOu, Voltage  Rati ng:75. OOOv, 
Pop:3S36 

-UPl i0180-2904.Kfri Various. Capacitance  Va: .lOOu, Voltage  Rati ng:75. OOOv, 

Pop  I  63  60 

-UPl :0180-2925,K£r! Various, Capacitance  Va:82. OOOu, Voltage  Rating; 10. OOOv. 
Popi 3884 


Fa 11 /Hours 
or  Miles  (E6) 

0/6.1464 
0/66  ,2220 
0/7.9924 
0/13.6916 
0/447.3768 
0/224.4528 
0/180.5128 
12/424.3252 
0/71.7280 
0/246.1784 
0/147.3784 
0/1.5236 
0/49.4988 
0/86.9128 
0/2.4596 
0/0.0156 
4/229.3096 
0/6.2556 
4/443.7888 
0/37.3048 
0/39.6708 
0/1.4664 
0/2.2100 
0/43.4666 
0/16.6764 
0/9,0480 
0/9.7240 
0/167.8560 
0/11.6480 
0/113.5368 
0/221.9048 
0/80.0852 
0/55.9728 
8/295.8956 
0/16.3540 
0/28.7976 
0/248.7992 
0/2.2800 
0/5.6680 
0/5 .3144 
0/6 .1100 
0/4.5968 
0/8.2600 
0/5-0492 
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3-120  Part  Details 


EPRD-97 


Part  Quality  App  Data  Part 

D«sc.  Laval  Env  Source  Characteristics 

Capacitor , Fixed, Electrolytic, Tan talun 

CoBwerclal  CB  13567-021-UPI 10180-2927. MfnVarious.StyleiWet  Slug. Capacitance  Vaj650.000u, 

Voltage  Rating j20 .OOOv. Popi 1316 

-UP#  iOieo-2929.Mtr {Various. Capacitance  Va :68 . OOOu, Voltage  Rating  1 10 . OOOv, 
Popil28252 

-UP#  sOieO-2944.Mfr{Varlous. Capacitance  Va : IS . OOOu, Voltage  Ratingt20 .OOOv. 
Popil2952 

-UP#  1 01 80-301 8, Mfr I Various. Capacitance  Va:4 .700u. Voltage  RatingtlO.OOOv. 
Popil664 

-UP# :0180-3020,Mf r {Various. Style iWet  Slug. Capacitance  Vat  120 . OOOu, 

Voltage  RatlngiSO.OOOv, Pop{21640 

-UP#  (0180-3051, Mfr {Various. Capacitance  Va 1 150 .OOOu. Voltage  Rating (6. OOOv, 
Popi2712 

-UP#  (0180-3073. Mfr I Various, Capacitance  Va 1 2 .200u, Voltage  Rating (30. OOOv. 
Popil26e8 

-UP#  tOlBO-3074. Mfr {Various, Capacitance  Vai 15. OOOu, Voltage  RatlngtSO.OOOv, 

Pop  1 24760 

-UP# 1 0180-3084. Mfr {Various. Capacitance  Va (22. OOOu, Voltage  Rating{20.000v, 
Pop{4604 

-UP#  1 0180-3087.Mfr{ Various. Capacitance  Vail.500u, Voltage  Ratlng{60.000v. 
Pop{7ll2 

-UP# (018 0-3 13 5. Mfr {Various. Capacitance  Vat  10. OOOu, Voltage  RatlngilO.OOOv. 
Pop{16 

-UP# {0180-3143. Mfr (Various. Capacitance  Vat 3. 300u. Voltage  Ratlngt75.000v, 
Pop{20056 

-UP# (0180-3210, Mfr (Various, Capacitance  Va (27 .OOOu, Voltage  RatingtlO.OOOv. 
Pop(2040 

-UP#  1 0180-3319. Mfr (Various, Capacitance  Vat22. OOOu, Voltage  Rating 1 50. OOOv, 
Pop{688 

-P#tl5lD236X9035Z2,Mfrt3M  Co. .Capacitance  Va t23 .OOOu, Voltage  Rating i 35. OOOv. 
Pop{424 

-UP#  (0180-3 422, Mfr (Various, Capacitance  Vat  10. OOOu, Voltage  RatingtlO.OOOv, 

Pop (5176 

-UP#  1 0180-3440. Mfr (Various, Capacitance  Vat 47. OOOu. Voltage  RatingtlO.OOOv, 
Poptl2644 

-UP#  1 0180-3487, Mfr (Various, Capacitance  Vat  150. OOOu, Voltage  Rating 1 16 .OOOv. 
Poptll544 

-UP# (0180-3573, Mfr (Various, Capacitance  Vat  10. OOOu, Voltage  Rating 1 35 .OOOv. 
Poptl984 

-UP# (0180-3574. Mfr (Various. Capacitance  Vat 3 .300u, Voltage  RatingtlO.OOOv. 
Pop(2652 

-UP# 10180-3629, MfriVarious, Capacitance  VatlS. OOOu, Voltage  Ratlngi35.000v. 
Popil8960 

-P# (T36eC476K020AS,MfrtKemet  Electronics  Corp. .Capacitance  Vat 47. OOOu. 
Voltage  Ratlngt20 .OOOv, Popt 8932 

-P« (19601164. Mfr (3M  Co. .Capacitance  Va t 2 .200u. Voltage  Ratlngt75.000v. 
Poptl6684 

-P# tT350G156M025AS,HfrtKeaet  Electronics  Corp. .Capacitance  Vat  15. OOOu. 
Voltage  Ratlng(25 .OOOv, Popt 9968 

-UP#  (0180-3741, Mfr (Various, Capacitance  Vai2 .200u, Voltage  Ratlngi25.000v, 
Popil6332 

-UP# 1 0180-3743. Mfr (Various, Capacitance  Vat .680u. Voltage  Rating (50 .OOOv. 
Pop(ie9S2 

-UP#  (0180-3744, Mfr (Various, Capacitance  Va(4 .700u, Voltage  RatingtlO.OOOv, 
Poptl8952 

-UP#  (0180-374 9, Mfr (Various, Capacitance  Vat .220u, Voltage  Rating 1 35. OOOv. 
Popt360 

-UP#  (0180-3750, Mfr I Various, Capacitance  Va 1 4 .700u. Voltage  Rating 1 35 .OOOv. 
Poptl20 

-UP# (0180-3751. Mfr (Various. Capacitance  Va t 1 .OOOu. Voltage  Ratlngt3S.000v, 
Popt39780 

-UP# 10180-3752, Mfr (Various, Capacitance  Va t 3 . 300u, Voltage  Rating i 35 .OOOv. 
Popt2868 

-UP# (0180-3753 . Mfr (Various, Capacitance  Va 1 6 . 800u, Voltage  Rating (35 .OOOv. 
Pop(33464 

-UP# (0180-3754, Mfr (Various. Capacitance  Vat  10. OOOu, Voltage  Rating (25. OOOv, 
Pop(69140 

-UP# (0180 -3755. Mfr (Various. Capacitance  Vat 3 3. OOOu. Voltage  RatingtlO.OOOv. 
Poptl536 

-UP# (0180-3767, Mfr (Various. Capacitance  Va (3 .300u. Voltage  Rating;25 .OOOv. 
Poptl4952 

-UP#  (0180-3768. Mfr (Various. Capacitance  Va  1 3 . 300u. Voltage  Rating ; 35 . OOOv. 
Pop(70648 

-UP# (0180-3769,Mfr {Various. Capacitance  Va ;6 . 800u. Voltage  Rating : 35 . OOOv, 
Pop(37704 

-UP# (0180-3770, Mfr (Various. Capacitance  Va i2 .200u. Voltage  Raclng:35.000v. 
Pop:108620 

-UP# (0180-3771. KfrtVarlous, Capacitance  Va : 1 .OOOu, Voltage  Ratingi 35 . OOOv. 

Pop (3 17892 

-UP# (0180-3772, MfrtVarious. Capacitance  Va : 10 .OOOu, Voltage  Racing : 25 .OOOv, 
Pop: 1728 

-UP#  ( 0180-3773. Mfr (Various. Capacitance  Va : .470u. Voltage  Rating : 35 .OOOv. 
Popt2860 

-UP# {0180-3775.Mfr (Various, Capacitance  Va : 68 . OOOu, Voltage  Rating : 10 . OOOv. 
Pop:167132 

-UP# (0180 -3784. Mfr (Various. Capacitance  Va : 22 . OOOu. Voltage  Rating : 25 . OOOv. 
Pop (371876 

-UP# (0180-3803, Mfr :3M  Co .. Capacitance  Va : 6 . 800u. Voltage  Rating ;75 .OOOv, 

Pop: 12000 


Fail /Hours 
or  Miles  (E6) 

0/1.7108 

0/166.7276 

0/16.8376 

0/2.1632 

12/28.1320 

0/3.5256 

0/16.4944 

0/32.1880 

0/5.9852 

0/9.2456 

0/0.0208 

0/26.0728 

0/2.6520 

0/0.8944 

0/0.5512 

0/6.7288 

0/16.4372 

0/15.0072 

0/2.5792 

0/3.4476 

0/24.6480 

0/11.6116 

0/21.6892 

0/12.9584 

0/21.2316 

0/24.6376 

0/24.6376 

0/0.4680 

0/0.1560 

0/51.7140 

0/3.7284 

0/43.5032 

0/89.8820 

0/1,9968 

0/19.4376 

0/91.8424 

0/49.0152 

0/141.2060 

0/413.2596 

0/2.2464 

0/3.7180 

0/217.2716 

0/483.4388 

0/15.6000 
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3-130  Part  Details 


EPRD-97 


Parc 

D«sc. 


Quality 

Laval 


App  Data 
Env  Source 


Part 

Characteristics 


Capacitor. Fixed, Electrolytic.  Tantalum 

Military  CF  14851-000-P# iCSR13BC105K, Mil • jK3 9003 -0 1 -23 S7 .NSN : 5910-00-495-0042 , Kfr j Various . 

PKg . iKetal.Henn. t Hermetic , Polar Ity: Polar  lied. Tol . j ♦/-20, 

Capacitance  Va i 1 . OOOu, Voltage  Racing j 50 . OOOD. Pop » 1161 
-PI 1CSR13E475K.M11I «M39003 -01-2368 . NSM i 5910-00-007-2004 . Mfr ; Various . 
Pkg. 1 Metal. Harm. i Hermetic, Polar  Icy : Polarized. Tol . i ♦ /-lO. 

Capacitance  Va j 4700000 . OOOp. Voltage  Rat ing : 50 . GOOD. Pop i 258 

-PI«CSR13C475KM.Mil# tM39O03-Ol-2368.NSN;591O-Ol-O86-llOB,Mfr jVarlous, 
Pkg,  iMetal.Herm.  iHermetic.  Polarity i Polarized. Tttl .  i*/-20. 

Capacitance  Va i 4700000 . OOOp, Voltage  Rating :50 . GOOD. Pop: 516 
-PI ICSR13C156MM. Mil# IM39003 -01-2378. MSN, 5910-00-137-7584. Mfrj Various. 

Pkg. iMetal.Herm. i Hermetic, Polarity i Polarized. Tol . i */-20. 

Capacitance  VailS.OOOu, Voltage  Rat Ing i 50 . GOOD. Popi 129 

-PI  iCSR13C685KM.HllliM39003-0l-3024.NSNi5910-00-137-7584.Mtr  I Various. 
Pkg. iMetal.Herm. iHermetic . Polarity i Polarized. Tol . i ♦/-lO. 

Capacitance  Va t6800000 .OOOp, Voltage  Ratingi35.000D. Popil29 
-PI .CSR13C335KM.Mill iK39003 -03 -003 6 , NSN i 5910 -00-010-8159 . Mfr i Various , 
Style iCSR.  Pkg.  iMetal.Herm.  iHermetic.  Polarity  i Polarized. TXjI  .  i*/-20. 
Capacitance  Va i33 , OOOu, Voltage  Rating i 15 . OOOD. Popi 129 

-PI 1CSR13C685KM, Mil I iM39003-01-3024.NSNi 5910-00-144 -43 81. Mfr I Various. 
Style iCSR. Pkg. iMetal.Herm. iHermetic. Polarity i Polarized. Tol .  i*/-10. 
capacitance  Vai6800000. OOOp, Voltage  Ratingi 35 .GOOD. Popi25B 

-PI. CSW3C475KK.Kill. 109003-01-2255, NSN. 5910-00-418-9775. Ktr.V.rloui 

V.. 4700000. OOOP, 

Capacitance  Va 1 33  - OOOu. Voltage  Ratingi 10 .0000, Pop 1 129 
'd! NSNi 5910-00-105-1976. Mfr iVarlous 
Pkg. iMetal.Herm. iHermetic, Polarity i Polarized. Tol . i*/-l0, 

Capacitance  Va i 39 .OOOu. Voltage  Rating i 10 . OOOD. Popi 129 

5910-00-550-1901. Mfr.  Various 
Pkg. iMetal.Herm. i Hermetic. Polarity « Polarized, Tol . i ♦/ -20 . 

Capacitance  Va i 100 .OOOu, Voltage  Rat ing i 10 . OOOD. Pop i 129 
'd! NSNi 5910-00-165-0629. Mfri Various 
Pkg. iMetal.Herm. iHermetic. Polarity i Polarized, Tol . i^/-20 

fUclngil5.000D.Popi387 

'D!'^^”^P^^*^'**^^***^^°°^"‘^^*^^®^'”^‘5910-00-007-2002.MfriVarious 
Pkg. iMetal.Herm. « Hermetic, Polarity i Polarized. Tol . i>/-10. 

Capacitance  Va  i2200000  .  OOOp,  Voltage  Rating  1 20  ,  OOOD  EN3oi25a 

-PI.CSRUE1568N,Mlll:H39003-01-22B9,NSN,59?0-Si:«2:4^5”L,V.rlous 
PKg.  iM«c»l,H«rn.  iH.riMtlc,  Pol»rlcy!F>ol*rli»<l,T01.  i*/-10. 

Capacitance  Va ; 15 . OOOu. Voltage  Racing ; 20 . OOOD, Pop i 64 5 

‘5910-00-283-3092,  Mfr.  Various 

Pkg..N.c.l.H.r»..H.nn«clc,i.ol.rlcy,PoUrlz«J,'K,l.,./-20 

Capacitance  Va : 13 . OOOu, Voltage  RaCing.20.000D, Popi258 

Pkg. iMetal.Herm. iHermetic, Polarltyipolarlzed. Tol. .♦/-lO 
Capacitance  Va . 47 . OOOu. Voltage  Racingi20. OOOD, Pop. 129 

'■2296,  NSNi59l0-00-113-5689.  Mfr  .Various 

Pkg.  IMetal.Herm.. Hermetic.  Polarity. Polarized. 'rk>l.,./-20 

vai47. OOOu.  Voltage  Rat  ing :  20 .  OOOD.  Pop .  129 

5910-00-465-0312, Mfr.  Various 
Pkg. iMetal.Herm. iHermetic. Polarity i Polarized. Tol . t>/-l0 

Racing.  35.  OOOD.  Pop  1774 

‘Various. 

Pkg. . Metal. Herm. iHermetic. Polarity « Polarized. Tol . .♦/-20 
Capacitance  Va . 47 .OOOu. Voltage  Rating : 35 , OOOD. Pop i 3 07 

*2344,  NSNi  5910-00-866-3153.  Mfr 'Various 
Pkg. iMetal.Herm. iHermetic . Polarity . Polarized. Tol . .♦/-lO. 

Capacitance  Va . 220000 . OOOp. Voltage  Racing ; 50 . OOOD. Pop . 64 5 


Fall/Hours 
or  Miles  (E6) 

2/28.8183 

0/6.4041 

0/12.8081 

0/3.2020 

0/3.2020 

0/3.2020 

0/6.4041 

0/3.2020 

0/3.2020 

0/3.2020 

0/3.2020 

0/9.6061 

0/6.4041 

0/16.0102 

0/6.4041 

0/3.2020 

0/3.2020 

1/19.2122 

0/9.6061 

0/16.0102 
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AN  APPLICATION  OF  PROFILER  FOR  MODELING 
THE  DIFFUSION  OF  ALUMINUM-COPPER  ON  A  SILICON  SUBSTRATE 


Matthew  E.  Edwards 
Associate  Professor  of  Physics 
Department  of  Physics 
Spelman  College 


Abstract 

Electromigration,  the  undesirable  movement  of  constituent  atoms  of  the  medium, 
is  a  deleterious  phenomenon  for  microcircuits  and  interconnects.  In  both  cases,  the 
electrical  currents,  of  the  involved  circuits,  literally  induce  and  perpetuate  the  condition  of 
atomic  migration.  To  better  understand  this  phenomenon,  and  to  seek  ways  to  contain  or 
arrest  its  progression,  systematic  considerations  are  needed  to  measure  and  predict  its 
occurrence,  and  methodologies  are  needed  to  halt  its  intrusion.  Both  theoretical  and 
experimental  considerations  of  interdififiasion,  under  thermal  stressing,  are  viable  activities 
for  addressing  electromigration.  In  regards  to  theoretical  considerations,  this  research  has 
determined  that  the  fixed-code  program,  PROFILER,  is  grossly  inadequate  and 
inapplicable  for  describing  Auger  Spectroscopic  data  of  thin  films  of  Al-Cu  on  silicon 
substrates.  However,  the  program  is  applicable  to  idealized  situations  of  flat  interfaces 
and  constant  functional  forms.  Therefore,  initial  efforts  have  been  made  to  develop 
PROFILER  11,  a  more  powerful  and  extensive  computer  program  that  would  be  able  to 
handle  variable  interfaces,  with  the  inclusion  of  a  data  smoothing  component  through 
convoluted  averaging.  To  that  end,  essential  issues  of  non-equilibrium  statistical 
mechanics  have  been  reviewed,  studied,  and  described.  The  U.S.  military  protection  of  its 
citizens,  as  provided  by  the  Air  Force,  will  be  enhanced  by  having  its  electronic  devices 
guarded  against  electromigration,  which  can  be  understood  through  the  anticipated 
capabilities  of  the  computer  code,  PROFILER  11. 
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AN  APPLICATION  OF  PROFILER  FOR  MODELING 
THE  DIFFUSION  OF  ALUMINUM-COPPER  ON  A  SILICON  SUBSTRATE 


Matthew  E.  Edwards 


Introduction 

Interdiffiision  is  an  important  process  in  reliability  physics.  It  is  the  process  that  is 
occurring  when  the  components  in  connecting  alloys  diffuse  across  the  region  of  coupling. 
This  process  finds  itself  applicable  in  issues  associated  with  interconnects,  composite 
materials,  high-temperature  coatings,  and  thin-film  devices  [1-6].  Its  importance  for  the 
reliability  of  interconnects  is  in  the  prevention  or  at  the  very  minimum  the  prediction  of 
electromigration,  where  the  latter  is  an  undesirable  phenomenon  of  atomic  migration.  The 
region  of  atomic  migration  is  referred  to  as  the  diffusion  layer.  This  layer  is  often  the 
region  where  corrosion,  electrical  anomalies,  embrittlement  and  other  deleterious 
processes  occur.  In  the  cases  of  high-temperature  coatings  and  thin-film  devices,  for 
instance,  interdiffiision  can  lead  to  early  electronic  device  failures.  Interdiffusion,  as 
brought  on  by  electromigration  from  currents  in  circuits,  can  also  lead  to  device  failure. 
Therefore,  there  is  a  need  to  better  understand  the  nature  of  metallic  diffusion,  and  to  have 
predictive  analyses  of  the  difflisants’  concentrations  as  a  function  of  time  and  penetration 
depth. 

In  this  investigation,  the  applicability  of  PROFILER  has  been  extensively  studied 
with  the  essential  outcome  that  this  fixed-code  program  is  inappropriate  for  describing 
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variable-interfaced  thin  films  of  Al-Cu  on  a  silicon  substrate  as  provided  by  Auger 
spectroscopy.  Therefore,  initial  efforts  have  been  made  to  develop  PROFILER  11,  such 
that  the  latter  will  be  able  to  handle  variable  interfaces,  with  a  component  for  smoothing 
the  spectroscopic  data. 

Discussion 

Since  the  application  of  PROFILER  to  flat  interfaces  is  a  valid  and  a  significant 
operation,  the  details  of  its  features  are  presented  in  the  following.  Each  item  is  described 
to  varying  degrees  (See  Table  1.  for  a  layout  of  PROFILER’S  main  menu): 

1.  Load  -  This  sub-menu  item  reactivates  existing  or  previously  prepared  files  into  the 
operating  program.  Pressing  <Enter>  displays  on  the  screen  all  file  names  that  have  been 
saved  in  prior  sessions.  Select  the  desired  file  and  press  <Enter>. 

Note:  The  Load  entry  should  not  be  selected  until  files  have  been  appropriately  saved. 
If  selected  before  files  are  saved,  the  program  gives  an  error  message. 

(The  user  should  start  with  the  New  screen,  as  described  later,  to  avoid  this  problem). 

2.  Save  -  This  item  saves  the  current  data.  The  program  prompts  the  user  to  enter  a  file 
name  having  up  to  8  letters  or  numbers.  In  all  cases,  saved  information  items  or  selections 
to  PROFILER  are  implemented  by  pressing  <F2>. 

3.  Save  As  -  This  item  allows  the  user  to  change  the  name  of  an  existing  file  that  is 
currently  running.  The  existing  file  name  appears  after  Save  As  on  the  main  menu. 
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Table  1 :  PROFILER’S  Main  Menu 


Files 

1.  Load 

2.  Save 

3.  Save  As 

4.  New 


Data 

^Granhics 

* 

7.  System  Information 

15.  **Diffusion  Time 

1 

8.  ****L  matrix 

16.  Concentration  Differences 

1 

8.1  Frequency  Factors 

17.  ***Display  Concentration 

2 

8.2  Activation  Energies 

Profiles 

2 

5.  Exit  (save  first) 


8.3  Tracer  DifTusivities 


2 


6.  Quit  (no  save)  8.4  L  matrix 

9.  ****G  matrix 

9.1  Regular  Solution 
Parameters 

9.2  G  matrix 

I.  D  matrix 

II.  r  mEigenvectors  of  r 
12.  Alpha  matrix 


*Selectional  only  after  System  Information  fill-in  sheet  and  the  D  matrix  have  been 
completely  implemented. 

**A  diffusion  time  always  exist,  having  a  default  value  of  360,000  seconds. 
***Further  selective  only  after  concentration  Differences  have  been  specified. 
****Has  second  level  menu  items  as  shown. 
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4.  New  -  This  item  is  used  to  create  a  new  file.  It  should  be  the  first  selection  made  for  a 


beginning  session  with  PROFILER  When  selected,  a  new  System  Information  Screen  is 
displayed  and  the  user  must  enter  the  following  information  about  the  difiusion  couple: 

a.  The  number  of  alloying  elements  (diffiisants)  in  the  couple.  The  program  has 
a  default  value  of  2. 

b.  The  temperature  ( in  Kelvin)  at  which  diffusion  takes  place.  The  program 
has  a  default  value  of  1 500K. 

c.  Average  mole  fractions  of  the  solutes  in  the  couple.  Note:  The  program  cal¬ 
culates  the  average  mole  fraction  of  the  host  element,  from  information  about  the 
solutes. 

d.  Abbreviations  for  each  alloying  element  (optional). 

e.  MO  (Structural  Factor)  -This  item  has  a  default  value  of  zero.  It  gives  inform¬ 
ation  on  the  diffusion  steps  which  may  go  backwards  rather  than  forward.  Typical 
values  are  as  follows: 

Simple  cubic  -  3.77  Body  Centered  Cubic  -  5.33 

Face  Centered  Cubic  -7.15  Diamond  -2.00. 

If  the  appropriate  information  is  not  entered  on  the  System  Information  Screen,  the 
program  gives  an  error  message  and  automatically  stops  running. 

5.  Exit  tsave  first!  -  This  item  will  exit  the  program  and  save  the  data  under  the  user's 
specified  filename. 
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6.  Quit  (no  save')  -  This  item  will  exit  the  program  without  saving  the  entered  data.  The 
program  has  a  safety  feature  which  prompts  the  user  to  save  the  data  before  exiting  the 
program. 

7.  System  Information  -  This  item  is  the  same  as  that  available  under  #  4  above.  It  gives 
pertinent  information  about  the  current  diffusion  couple,  and  can  be  selected  at  any  time 
while  using  the  program. 

8.  L  matrix  -  This  item  is  where  tracer  diffusion  data  are  entered. 

8.1  Frequency  factor  (A')  -  (Not  considered  in  this  investigation.  The  program  runs  and 
gives  satisfactory  results  without  specifying  the  frequency  factor.) 

8.2  Activation  energies  tOl  -  The  typical  energies  for  migration  of  involved  atoms 

8.3  Tracer  diffusivities  fPV  -  The  typical  diffusion  coefficients  of  involved  atoms 

8.4  L  matrix  -  (Not  considered  in  this  investigation.  The  program  runs  and  gives 
satisfactory  results  without  specifying  the  L  matrix.) 

9.  G  matrix  -  This  is  a  pull-down  menu  to  enter  thermodynamic  information  about  the 
diffiisants  or  atoms. 

9. 1  Regular  Solution  Parameters  -  These  values  can  be  approximately  related  to  heats  of 
mixing,  ,  mole  fractions,  ,  and  other  measured  thermodynamic  quantities  by 
various  formulas,  e.g.: 

9.2  G  matrix  -  The  second  derivative  of  the  free  energy  are  entered  here  in  units  of  J/mole, 
where 
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G 


10.  D  matrix  -  The  diffusivity  matrix  elements  [6-8]  are  entered  here  if  they  are  known. 

1 1 .  r  matrix  -  The  square  root  diffusivity  matrix  [r]  is  entered  here. 

Note:  If  L  and  G  matrices  are  provided,  then  PROFILER  will  automatically 
calculate  D  and  r.  If  D  or  r  is  provided,  the  program  will  calculate  the  one 
that’s  not  entered.  The  calculated  diffusivities  appear  on  their  respective 
menu  screens. 

12.  Eigenvalues  of  r  -  The  square  roots  of  the  eigenvalues  of  D  are  displayed 
(i.e.  ^.  =  VA  )• 

13.  Eigenvectors  of  r  -  The  a"'  matrix  is  displayed.  Columns  of  this  matrix  are 

eigenvectors  of  both  D  and  r.  The  a  and  a”'  matrices  diagonalize  D  by  the 
transformation,  Z),  =  oDa"' . 

14.  Alpha  matrix  -  This  item  displays  the  a  matrix. 

15.  Diffusion  Time  -  The  isothermal  heat  treatment  time  is  entered  in  units  of  seconds.  The 
program  has  a  default  time  of  360,000  seconds  ( 100  hours). 

16.  Concentration  Differences  -  The  initial  concentration  differences  between  the  diffus¬ 
ing  couple  are  entered.  The  values  are  obtained  by  subtracting  concentrations  on  the 
left  side  of  the  coupling  interface  from  those  on  the  right  side. 

17.  Displays  Concentration  Profile  -  The  monitor's  screen  shows  the  concentration  profiles 
of  each  solute  (difflisant).  The  axes  variable  are  the  followings: 
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X  axis  -  Gives  the  distance  from  the  initial  interface  (a  distance  of  10  |im  is 
between  graduations  marks  on  the  X  axis). 

Y  axis  -  Gives  the  difference  in  concentration  between  the  local  concentration  and 
the  average  concentration  for  each  solute.  ( 0.2  of  units  of  the 
concentration  are  between  graduations  on  the  Y  axis). 

The  purpose  of  the  Numeric  menu  is  to  create  a  file  of  concentration  profiles  in  a  tabular 
form.  This  file  can  then  be  imported  into  a  spread  sheet  or  plotting  program  to  obtain  a 
hard  copy  [9]  of  the  concentration  profiles  or  to  construct  diffusion  paths. 

18.  Diffusion  Time  -  Displays  the  time  as  entered  in  #  15  above. 

19.  Concentration  Differences  -  Displays  the  initial  concentration  differences  as  used  to 
calculate  the  concentration  profiles. 

20.  View  matrix  A  -  The  concentration  profile  for  an  n  component  diffusion  couple  is 
given  by  the  sum 

C, (x,t)  =  C,'  - 2 A„erfc{x / (l^DA)) , 

i=l 

where  Cf  is  the  initial  concentration  of  solute  i  in  the  alloy  on  the  right  side  of  the  initial 
interface,  and  Dj  is  one  of  the  n-1  eigenvalues  of  the  D  matrix  [5-8].  The  D  matrix  gives 


the  presentation  of  the  Ay  coefficients  of  the  complementary  error  function,  erfc(; 


2yfDi^' 


21.  Generate  Numeric  Data  —  Commas  -  This  item  creates  a  file  with  numeric  data 
separated  by  commas. 
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22.  Generate  Numeric  Data  —  Spaces  -  This  item  creates  a  file  with  numeric  data 
separated  by  spaces. 

The  format  of  the  output  file  is  the  following: 


column  1 

column  2 

column  3 . 

. column  n. 

-0.00100 

0.2 

0.3 

-1.4 

Column  1  gives  x  axis  values  in  centimeters  (x  =  0  is  the  original-finite  interface  between 
the  alloying  couple). 

Column  2  gives  concentration  differences  for  the  first  solute  between  the  local 
concentration  and  the  average  initial  concentration  ,  where 

+C^ 

AC,=Q{x,t)-^Y^. 

( The  concentration  difference  is  always  zero  at  x  =  0 ). 

Column  3  gives  the  concentration  differences  for  the  second  solute. 

Column  n  gives  the  concentration  differences  for  the  n-1  solute. 

Using  the  above  menu  items,  PROFILER  has  been  studied  to  asset  its  ability  to 
depict  the  non-equilibrium  movement  of  atomic  particles  (Aluminum  and  Copper  Atoms) 
on  a  Silicon  substrate.  The  program  functions  adequately  for  flat  interfaces.  However,  real 
coupling  at  interfaces,  as  represented  by  Auger  Spectroscopic  data,  is  such  that  it  is  rarely 
if  ever  completely  flat.  Thus  the  question  becomes  -  would  it  be  economically  feasible  to 
write  a  differ  computer  code  for  the  varied  interfacial  configurations?  We  supplied  a  yes  to 
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this  question,  and  set-about  initiating  the  development  of  such  a  program.  To  that  end,  we 
reviewed  non-equilibrium  statistical  mechanics,  considering  such  issues  of  time  correlation 
functions,  relaxation  to  equilibrium  and  stochastic  processes. 

The  following  table  gives  essential  information  on  A1  and  Cu  that  has  been  useful 
in  the  development  of  the  extended  computer  program.  It  gives  data  on  coordination 
numbers,  lattice  parameters,  etc. 


Density 

Atomic 

Crystal 

Lattice 

Distance 

Melting 

Coordin- 

Name 

(g/cm3) 

Number 

Structure 

Para- 

meter(A) 

of 

Closest 

Appro¬ 

ach 

Point(  C) 

ation* 

A1 

2.70 

13 

FCC,  A1 

2.862 

660 

12 

Cu 

8.90 

29 

FCC,  A1 

EEBuH 

2.556 

1083 

12 

Table  2.  Essential  Properties  of  A1  and  Cu.  *Coordination  number  is  the  number  of  nearest 
neighbor  atoms. 


Methodology 

The  outcome  from  PROFILER  may  be  obtained  from  one  of  two  different  modes 
of  operation,  depending  on  the  type  and  amount  of  information  that’s  initially  known 
about  the  diffusing  couples  [9-20].  Table  3,  below,  outlines  the  two  different  modes  of 
operation.  PROFILER  II  should  be  developed  with  similar  modes  of  operation. 


_ mmG  PROFILER  TABLE  3 _ 

_ Method  I. _ Method  n _ 

Pre-Determined  Diffiisivity  Method  Undetermined  Difflisivity  Method 

1 .  System  Information 
a.  Number  of  Components  in  the 
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interdiffusion-couple,  n  has  the 
value:  ( 1  <  n  <  8  ),  and  must  be 
the  first  entry. 


(Same  as  Method  I  for  a  -  d.) 


b.  Temperature  in  Kelvin  at  which  the 
diffusion  process  occurs. 

c.  Average  Mole  fraction  (at.  Pet.)  of 
each  solute  between  the  left  member  and 
the  right  member  of  the  interdiffusion 
couple.  (At  this  point  the  program 
calculates  the  average  fraction  of  the 
solvent,  the  "host"  element). 

d.  Enters  abbreviations  of  alloying 
element,  (optional  entry  -  but  helpful  for 
accountability) 

e.  Structure  Factor  (MO)  Unnecessary 
Entry  for  this  method.  (The  appropriate 
value  has  already  been  considered  in  the 
Pre-determination  of  the  diffusivity) 


2.  Difflisivities  Not  entered. 


3.  Diffusion  Time  in  seconds 

4.  Concentration  Differences 
Enter  the  concentration  of  right 

member  of  the  interdiffusion  couple  minus 
the  concentration  of  the  left  member. 

5.  Displays  Concentration  differences 


e.  Structure  Factor  (MO)  As  determined 


from  diffusion  theory; 

1.  simple  cubic  -  3.77 

2.  body  centered  cubic  -  5.33 

3.  Face  centered  cubic  -  7.15 

4.  Diamond  -  2.00 


2.  a.  Tracer  difftision  Values  (Highlight  L 
matrix  on  menu  bar).  Pre-exponential  (A) 
units  of  cm^2/sec,  Activation  Energies  (Q) 
units  of  kcal/mole 


b.  Thermodynamic  Data  -  Heat  of  mixing 
delta  H^mix,  Mole  fraction  of  N_I  and  NJ 
of  solutes 

3.  (Same  as  method  I) 

4.  (Same  as  method  I) 


5.  (Same  as  method  I.) 


(difference  between  the  absolute 
concentration  minus  the  average  as  entered 
for  the  diffusion  couple): 

Ordinate  axis,  in  units  of  at.  Percent 
Abscissa  axis,  in  units  of  cm. 


With  the  above  procedure  understood  on  just  how  to  use  PROFILER,  our  efforts 
became  one  of  developing  directly  and/or  obtaining  from  the  scientific  literature  a  scheme 
for  handling  the  variation  in  the  coupling  region  and  the  overall  lack  of  a  step  function 
behavior  for  the  diffusants  of  the  system,  copper  and  aluminum  on  a  silicon  substrate.  A 
scheme  to  reduce  the  random  fluctuations  in  the  Auger  Spectra  was  obtained  and  used[21- 
25].  This  scheme  used  the  method  of  convoluted  averaging  where  the  convolution 
function  was  obtained  from  a  least  squares  procedure.  This  resulted  in  an  invaluable 
smoothing  of  the  data  but  not  in  an  overall  reduction  in  the  interfacial  variation  and/or  the 
lack  of  a  constant  function  behavior  for  the  concentrations  away  from  the  interface.  Fig.  1 
depicts  the  dilemma  of  the  problem  showing  the  typical  nature  of  the  Auger  Data.  The 
concentration  function  for  A1  is  fairly  constant  from  a  depth  of  zero  nanometers  (nm)  up 
to  900  nm.  Then  the  function  varies  decreasingly  in  the  cross-over  region  (a  distance  of 
w  200  nm).  This  distance  is  more  than  600  times  the  distance  of  closest  approach  in  the 
aluminum  structure. 
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Concentration  as  Annealed  at  250  C 
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Fig.  1.  Shows  the  Concentration  Profiles  as  a  Function  of  Depth  into 
the  Sample. 

This  yields  a  transitional  interface  of  some  200  nm,  as  opposed  to  an  abrupt  situation.  For 
such  a  thin  film,  this  variational  region  is  too  large  to  be  considered  as  being  insignificant 
in  size.  Also,  Fig.  1  shows  that  the  presence  of  copper  sets-in  at  a  depth  of  about  900  nm 
and  continues  over  through  1350  nm,  in  the  shape  of  a  peaking  function  at  about  1 100  nm. 
The  presence  of  silicon  begins  to  show  up  at  a  sputtering  depth  of  700  nm,  out  to  and 
beyond  1400  nm.  Oxygen  and  carbon  are  minimally  present  as  impurities.  Fig.  2  is  a 
reproduction  of  Fig.  1  with  the  sputter  time  (at  a  rate  6.3  nm/min)  as  the  abscissa. 
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Original  Concentration  Data  As  Annealed  at  250  C 


Sputter  Time  (min) 

Fig.  2.  Shows  the  Concentration  Profiles  As  Produced  By  the 

Auger  Spectrometer 

From  the  typical  Auger  Spectra,  it  became  self-evident  that  our  efforts  could  be  better 
served  by  initiating  the  procedure  for  developing  PROFILER  II,  a  computer  program 
having  greater  versatility  in  general  interfacial  regions  and  allowing  for  non-constancy 
away  from  the  interfaces. 

To  that  end,  the  literature  was  explored  in  non-equilibrium  statistics  (transport 
theory),  equilibrium  statistics  and  thermodynamics  for  procedures  of  obtaining  and 
applying  the  dififiision  equation  to  various  interfacial  situations.  The  literature  [26-41] 
was  discovered  to  be  replete  with  information  on  the  diffusion  equation  as  it  is  used  in 
simple  geometrical  situations.  But  not  much  details  were  observed  on  comparative 


9-15 


derivations.  Atlas!  We  undercover  the  idea  that  diffusion  is  just  another  depiction  of  the 
random  -  walk  problem  and,  as  such,  the  diffusion  equation  can  be  derived  from  this  point 
of  view,  with  the  continuation  of  solving  the  same  in  very  general  geometrical  situations. 

Therefore,  we  have  obtained  the  derivation  of  the  diffusion  equation  in  one 
dimension  in  the  following.  The  derivation  is  succinct;  yet  complete.  It  is  presented 
below. 

The  non-equilibrium  process  of  diffusion,  in  a  liquid  or  solid,  as  in  a  gas,  may  be 
calculated  from  a  random  -  walk  description.  However,  the  physical  model  is  somewhat 
different  for  a  liquid  or  a  solid.  In  a  solid,  in  particular,  a  diffusing  particle  is  at  an 
equilibrium  position  with  respect  to  the  forces  exerted  by  neighboring  particles.  Nearby,  to 
the  site  in  question,  there  are  m  other  similar  positions  or  sites,  separated  from  the  given 
position  by  a  potential  barrier,  which  determines  the  probability  per  unit  time  that  the 
particle  jumps  to  one  of  these  available  sites.  Let  ^  be  the  distance  of  the  /th  new  position 
from  the  given  equilibrium  position,  let  p,  be  the  probability  that  a  particular  jump  is  to 
the  /th  new  position,  and  let  be  the  angle  between  the  x  -  axis  and  the  vector  from  the 
given  position  to  the  /th  position.  The  average  distance  a  particle  moves  in  the  x  direction 
as  a  result  of  the  /ih  jump  is 


d,=<x>= 


'ZpA  cos  9, 


•  1=1 


=<  ,^cos^  > 
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where  the  fences  o  denote  ensemble  averaging.  This  value  is  zero  if  there  is  no 
external  field  to  make  the  probabilities  differ  for  jumps  in  the  positive  or  negative  x  - 
directions.  The  average  distance  moved  in  the  x  direction  after  n  jumps  is  n  times 
<  ^cos^  > ,  which  also  vanishes.  On  the  other  hand,  the  average  square  of  the  distance 
moved  in  the  x-direction  is 

<  >=  =  n<A^  cos^  0  > 

h,k  h 

This  equation  is  valid  since  there  is  no  correlation  between  distance  moved  on  different 
jumps,  that  is,  (<  >=  0  for  h^k).  Furthermore,  if  there  is  no  correlation  between 

\  and  cos^ ,  which  is  assumed  to  be  the  case  in  this  derivation,  then 

<x^  >=  n<^  ><  cos^  6  > 

It  is  well  known  that  the  average  value  ofcos^^  is  1/3,  and  the  average  value  of  is 
itself  (the  average  of  a  constant  is  that  constant).  Thus  the  random  -  walk  model  leads 
to  the  result  that  the  average  square  distance  moved  by  a  particle  is  proportional  to  the 
number  of  steps  taken  and  to  the  square  of  the  average  step  length.  This  condition  on  the 

average  square  distance  is  valid  if  the  diffusion  coefficient  D  =  ^s?A  where  s  is  the 

number  of  steps  taken  per  unit  time.  If  we  combining  Pick’s  first  law,  that  the  flux  of 
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particles  is  equal  to  -D  multiplied  by  the  concentration  gradient,  with  the  equation  of 
continuity,  we  obtain  Pick’s  second  law  (the  diffusion  equation  in  one  dimension), 

dt  dx:\  dcJ 

Here,  p  =  p(x,t)  is  the  number  density  or  concentration  of  particles,  p{x,t)dx  being 
number  of  particles  between  x  and  x+dx  at  time  t.  It  is  normalized  according  to 

p+oo 

dxp{x,t)  =  N 

J  — 00 

where  N  is  the  total  number  of  particles.  Note  that  p  =  N  times  the  Probability  density 
function.  If  D  can  be  assumed  independent  of  x.  Pick’s  second  law  simplifies  to 

a  dx^ 

Normally,  D  depends  on  concentration,  and  diffusion  occurs  in  the  presence  of  a 
concentration  gradient,  so  D  will  depend  on  x.  If  the  concentration  is  low,  this  dependence 
may  be  unimportant.  In  solids  and  alloys,  involving  tracer  diffusion,  D  may  be  taken  to  be 
constant  since  the  tracer  particles  are  the  same  as  the  other  particles  except  for  labeling. 

In  preparing  for  the  development  of  PROFILER  //,  we  considered  the  solutions  to  Pick’s 
second  law,  which  depend  on  initial  conditions,  in  two  simple  geometrical  situations,  the 
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point  function  and  the  step  function.  If  p  =  NS(x)  at  t=0  (all  particles  are  at  x=0  to  start, 
the  point  function),  the  solution  is 


p  =  N{AnDt)  exp 

where  <  >=  2Dt .  On  the  other  hand  the  step  function  has  the  initial  conditions 

p  =  Po  for  X  <0  and p  =  0  forx  >  0  , 

(particles  are  on  one  side  of  the  finite  plane  at  x  =0).  The  solution  can  be  found  by 
considering  this  initial  density  profile  as  a  superposition  of  S  -function  profiles,  so  that  for 
t>0  one  can  just  superpose  density  profiles  of  the  form  as  given  above.  If  the  step- 
function  profile  is  written  as 


p(x,0)  =  Poj  6ix-y)dy 

»  — 00 

the  density  at  t>  0  is 


p(x,  0  =  A  f  {^nDt)  exp 

•'—00 


ADt 


dy 


On  substituting  z  =  x  -  y  as  the  variable  of  integration,  this  becomes 
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p{x,t)ip,={A7£>tr'^ 


1 


r 


ixn/Dt 


e-y  cfy= 


1  2  r«>  2 

0-7=1 

2  ixnJWt 


=  -^erM 


X 

iWt 


\-erf(^ 


X 

lyfDt 


) 


where  erf( 


X 

2y[Di 


)  is  the  error  function.  Mathematically,  this  function  is  defined  to  be  as 


2  /  4^  times  the  integral  from  0  to  of  6xp(-^^)  so  that  erf  (oo)  =  1.  The  above 

expression  for  p{x,t)/ p^  is  plotted  in  the  results  section,  for  -/D?  =  0.1, 0.5, 0.9, 1.3, 1.7 
where  the  appropriate  series  expansions  are  used  for  the  error  function.  These  procedures 
have  provided  a  beginning  for  the  extension  of  PROFILER  to  a  new  and  more  powerful 
computer  program,  PROFILER  II. 

Results 

A  successful  smoothing  operation  was  obtained  from  the  equation 


i-m 

f=-m _ 

N 
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where  m  is  the  number  of  terms  used  in  the  convolution,  and  C,  '5  are  the  terms  of  the 
convolution  from  the  method  of  least  squares.  This  procedure,  however,  did  not  remove 
the  difficulty  of  the  variable  interface  of  the  coupling  dififiisants.  Therefore,  we  initiated  the 
development  of  a  self-contained  computer  program  with  appropriate  capabilities  that  are 
lacking  in  PROFILER.  Specifically,  we  obtained  the  dififiision  equation  from  the  general 
principle  of  random  -  walk,  and  obtained  its  solution  in  two  important  situations:  the  point 


function  and  the  step  function. 


Step  Function  Diffusion 


Using  the  series  expansions  for  the 
error  function,  we  obtained  the  graphs  of 
concentration,  as  depicted  in  fig.  3  for 

five  cases  of  ^Dt .  These  graphs  show 
the  manners  in  which  diffusion  takes  place 


x-^oordinate 


-Seriesi 

-  Series2 
-Series3 
-Series4 

-  SeriesS 


in  the  step  function  situation.  Note  that  as 

Fig  3 .  Solution  to  diffusion  equation 
4^  gets  larger,  greater  movement  of  the  for  various  values  of  4^ ,  starting 

with  a  step  profile. 

atoms  occurs  for  given  values  of  x.  The  mathematical  representations  of  the  erf  function 
as  used  in  Fig.  3.  are: 


2  f  a'’  a"  ^ 
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1 


1-3 


valid  for  a>\ 


r 


erf{a)^\- 


+  • 


yf^aV  2a^  (2a^r 


1-3-5  1-3-5-7  l-3-5-7-9^ 

(2a^y  ^  (2a^y  ~  (2a')'  ^ 

the  asymptote  region 


Continuation  in  the  development  of  PROFILER  II  to  handle  variable  initial  interfaces, 
even  statistically  fluctuating  interfaces,  is  an  obtainable  and  desirable  goal. 

Conclusion 


PROFILER,  as  a  fixed-code  computer  program,  has  been  shown  to  be  a  powerful 
and  efficient  tool  for  modeling  the  diffusion  of  metallic  atoms  across  a  flat  initial  plane  of 
coupling  for  two  alloying  components.  It  produces  the  concentration  profiles  in  a  matter 
of  a  few  seconds  for  up  to  eight  difflisants  in  the  alloying  couple.  It  is  more  than  adequate 
for  such  situations.  However,  its  usage  for  variable  interfaces  is  inappropriate  and 
ineffective.  Also,  when  using  PROFILER,  data  can  be  convolutedly  averaged  before 
reduction  to  lessen  statistical  noise,  without  degrading  the  Auger  signal. 

Faced  with  the  dilemma  of  the  fixed-code  of  PROFILER,  we  initiated  the  process 
of  developing  PROFILER  II,  and  have  progressed  to  the  point  of  developing  the 
analytical  functions  for  the  point  source  and  the  step  functions.  These  features  form  the 
basis  for  the  actual  development  of  a  new  computer  program  for  the  trace  diffusion  of 
atoms  of  thin  films  on  substrates..  It  remains  a  desirable  task  to  complete  the  development 
of  such  a  computer  program. 
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Analysis  of  Stressed  Speech  using  Cepstral  Domain  Features 


K.  Gopalan 
Professor 

Department  of  Engineering 
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Abstract 


TTie  effect  of  workplace  stress  on  speech  parameters  was  studied  in  this  project.  Correlation  between  certain  cepstral 
coefficients  and  the  heart  rate  of  fighter  aircraft  flight  controllers  was  investigated.  It  was  found  that  as  the  heart  rate 
increased  due  to  stress,  mel  cepstral  coefficients  corresponding  to  pitch  range  of  frequencies  showed  increasing 
periodicities  in  their  correlated  values.  In  addition,  the  index  of  peak  coefficients  increased  with  increasing  heart 
rate  while  the  standard  deviation  of  cepstra  at  all  the  indices,  in  general,  showed  a  decreasing  trend  with  increasing 
heart  rate.  As  with  pitch  frequencies,  cepstral  indices  at  which  proportional  variation  with  heart  rate  occurred, 
depended  on  the  speaker  and  the  phoneme. 
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Analysis  of  Stressed  Speech  using  Cepstral  Domain  Features 

K.  Gopalan 
I.  Introduction 

Analysis  of  human  stress  based  on  the  speech  signal  output  of  a  speaker  is  important  in  many  applications. 
These  applications  include  automatic  monitoring  of  the  stress  levels  of  callers  for  emergency  services  so  as  to 
respond  with  appropriate  service  personnel,  and  analyzing  the  physiological  and  emotional  state  of  fighter  aircraft 
pilots  to  decide  on  their  suitability  to  proceed  with  their  assigned  task.  In  addition,  acoustic  correlates  of  stress  alter 
speech-  and  speaker-dependent  parameters;  this  alteration,  unless  compensated  for  in  the  feature  domain,  can  result 
in  poor  performance  of  speech  and  speaker  recognition  systems. 

This  report  presents  the  results  of  analyzing  certain  cepstral  domain  features  as  a  function  of  the  heart  rate 
of  fighter  aircraft  pilots. 


II.  Review  of  Speech  Features  Affected  by  Stress 

One  of  the  predominant  effects  of  stress  on  speech  signal  is  the  variation  in  the  fundamental  frequency,  FO. 
This  was  demonstrated  by  Lieberman  and  Michaels  [1]  by  presenting  to  a  group  of  listeners  only  the  pitch 
information  available  in  tapes  containing  speech  at  different  emotional  states.  Several  researchers  have  recently 
found  that  FO  variations  correlated  well  with  stress.  In  one  of  the  early  works  by  Williams  and  Stevens  [2],  the 
median  value  and  the  range  of  FO  were  found  to  rise  for  speech  uttered  in  anger  or  fear  from  those  corresponding  to 
normal  speech;  for  sorrow,  the  value  and  the  range  of  FO  were  reduced  from  their  neutral  values.  Using  cepstral 
analysis  of  pitch.  Levin  and  Lord  [3]  showed  that  an  increase  in  the  range  averaged  pitch  frequencies  indicated 
emotional  change  from  normal  state.  Streeter,  et  al  [4]  found  that  a  listener’s  rating  of  a  speaker’s  stress  level 
correlated  positively  with  average  pitch;  however,  based  on  their  pitch  and  amplitude  measurements,  they  concluded 
that  no  single  acoustic  pattern  resulted  from  increased  stress.  Using  the  Teager  energy  profile,  the  change  in  the 
nonlinear  component  of  speech  was  shown  by  Cairns  and  Hansen  [5]  to  distinguish  clearly  between  loud  and  angry 
speaking  styles.  While  the  modulation  pattern  of  the  first  formant,  as  quantified  by  the  Teager  energy  profile, 
changed  appreciably  between  loud  and  angry  speech,  clear  speech  could  not  be  easily  differentiated  from  neutral 
speech.  Based  on  listeners’  perception  of  stress  in  FO-altered  speech,  Protopapas  and  Lieberman  [6]  found  that  mean 
and  maximum  FO  within  an  utterance  correlated  significantly  with  higher  stress  ratings.  They  concluded  that  except 
for  extreme  stress  and  terror,  their  results  could  not  be  generalized.  Hansen  and  Womack  [7]  used  a  set  of  mel- 
distributed  cepstral  features  and  a  neural  network  to  classify  1 1  stress  conditions  including  angry  speech,  question, 
soft  speech  and  slow  speech.  They  showed  that  different  stress  conditions  and  phoneme  classes  affect  the  features 
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differently;  vowels^  in  particular,  are  significantly  affected  with  a  high  degree  of  separation  using  autocorrelation  of 
the  cepstral  parameters  for  the  different  stress  conditions. 

Another  area  related  to  the  analysis  of  stressed  speech  is  concerned  with  the  task  of  identifying  the 
acoustic-phonetic  differences  between  normal  and  abnormal  speech.  Identification  of  features  that  distinguish 
between  normal,  loud  and  Lombard  speech,  for  example,  is  important  in  the  development  of  robust  speech  and 
speaker  recognition  systems.  The  recognition  systems  are,  in  general,  trained  using  mostly  normal  speech  because  of 
(a)  the  difficulty  of  collecting  data  under  abnormal  conditions,  and  (b)  the  volume  of  such  data.  When  presented 
with  data  collected  in  a  stressful  and  noisy  environment,  however,  the  performance  of  these  systems  becomes 
degraded.  Rajasekaran,  et  al  [8]  report  an  order  of  magnitude  decrease  in  the  recognition  of  speech  from  a 
vocabulary  of  words  under  simulated  stress  and  noise  conditions.  Since  a  large  variation  in  spectral  tilt  is  commonly 
observed  for  speech  under  stress,  Chen  [9]  proposed  a  cepstral  domain  compensation  to  normally  trained  word 
models  for  reducing  the  recognition  error  rate.  Stressed  speech  was  quantified  by  smoothing  and  fitting  the  means  of 
certain  cepstral  components  to  an  exponential  function.  By  subtracting  the  smoothed  values  from  test  vectors, 
significantly  low  recognition  error  rates  were  achieved  for  a  data  base  of  simulated  stressed  speech.  Using  a  data 
base  of  normal,  loud  (simulated  high  stress)  and  Lombard  (pink  noise  injected)  speech,  Stanton,  et  al  [10]  observed 
energy  migrations  in  different  frequency  bands  and  spectral  tilt  between  normal  and  abnormal  speech.  By 
accounting  for  these  variations,  Stanton,  et  al  [1 1]  report  an  improvement  in  the  performance  in  the  recognition  of 
stressed  speech.  Bou-Ghazale  and  Hansen  [12]  took  the  approach  of  modeling  stressed  speech  as  a  sequence  of 
articulator  movements  whose  paths  deviated  from  those  for  normal  speech.  Using  the  deviations  to  train  and  test, 
they  report  an  improvement  in  the  recognition  of  keywords  in  simulated  stressed  speech  in  noise-free  conditions. 

From  the  brief  discussion  above,  it  is  clear  that  different  conditions  of  physiological  and  emotional  stress 
affect  speech  features  differently.  Although  the  fundamental  frequency  FO  is  generally  observed  to  vary  with  stress, 
no  single  stress  token  appears  to  correlate  all  phonemes  under  various  speaking  styles.  The  difficulty  of  obtaining 
actual  stressed  speech  data  for  training  speech  recognition  systems  has  led  to  simulated  data  that  were  restricted 
mostly  to  loud  and  Lombard  styles.  Statistical  variations  of  features  in  the  cepstral  domain,  in  general,  have  been 
found  to  represent  well  the  stress-induced  variations  for  classifying  simulated  stressed  speech. 

III.  Problem  and  Approach 

In  this  study,  the  problem  of  correlating  a  measured  stress  factor  with  a  cepstral  domain  feature  was 
addressed.  The  Speech  Under  Stress  database  used  for  this  study  consists  of  speech  from  European  (non-English 
native)  fighter  controllers  under  stress.  The  database  contains  speech  utterances  from  nine  male  controllers  that  were 
recorded  during  communication  with  fighters.  Various  stress  factors,  such  as  systolic  pressure,  diastolic  pressure, 
carbon  dioxide  and  heart  rate,  were  measured  during  the  speech.  For  the  present  study,  heart  rates  were  chosen  to 
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relate  with  speech  features  because  of  the  nonavailability  of  other  measurements  for  all  the  speech  files  available  in 
the  database.  Each  speech  file  was  encoded  in  16  bits  per  sample  at  a  rate  of  16000  samples  per  second. 

Since  the  database  has  continuous  speech  with  a  large  vocabulary  of  words,  it  was  necessary  to  choose  an 
utterance  that  occurred  fi-equently  and  at  varying  heart  rates  for  each  speaker.  Three  different  heart  rates  with  as 
large  a  variation  as  possible  among  them  were  chosen  for  each  speaker.  It  was  found  that  the  utterance  ""bulls  eye"' 
most  commonly  occurred  with  three  different  heart  rates  for  many  of  the  speakers. 

Using  Entropic  Waves+  package,  the  utterance  ""bulls  eye''  corresponding  to  each  of  three  different  heart 
rates  was  extracted  for  each  speaker.  Cepstral  domain  features  were  computed  for  the  segmented  utterances  and 
analyzed  with  the  known  heart  rates. 

IV.  Low  Quefrency  Cepstral  Features 

Inasmuch  as  the  fundamental  frequency  FO  is  commonly  observed  to  rise  in  stressed  speech,  cepstral  features 
corresponding  to  the  range  of  FO  were  studied  for  each  speaker.  Table  I  lists  the  range  of  FO  values  for  the  speakers 
considered^ 


Table  I 

Range  of  FO  for  the  utterance  ""bulls  eye" 


Speaker 

Minimum  FO, 
Hz 

Maximum  FO, 
Hz 

JO 

142 

261 

RD 

101 

273 

RK 

96 

180 

SK 

90 

180 

VL 

100 

155 

Based  on  the  above  range  of  frequencies  for  FO,  mel-cepstra  were  calculated  at  the  center  frequencies  of  70 
Hz  to  300  Hz  in  steps  of  10  Hz  with  a  fixed  bandwidth  of  approximately  19.5  Hz  each.  A  frame  length  of  15  ms 
with  an  overlap  of  10  ms  was  used  in  segmenting  each  utterance.  After  testing  with  Hanning,  triangular  and 
exponentially  decaying  windows,  the  final  analyis  was  carried  out  using  Hamming  windows.  With  4096-point 
discrete-Fourier  transform  (giving  a  frequency  resolution  of  3.9  Hz),  cepstra  at  24  points  (quefrencies)  over  the  pitch 
range  of  frequencies  were  calculated  at  cepstral  indices  from  18  to  77  in  steps  of  2  or  3.  A  triangular  bandwidth  of  5 
points  (»19.5  Hz)  was  used  at  each  index.  In  addition,  another  set  of  cepstra  at  30  points  corresponding  to  formant 
range  of  frequencies  (at  approximately  250  Hz  to  5000  Hz)  were  calculated  as  discussed  in  Section  V.  The  features 


^  The  pitch  frequencies  were  obtained  using  Waves+  software  package  from  Entropic  [15]  with  the  same  frame 
length  of  15  ms  and  overlap  of  10  ms  as  for  the  cesptral  calculations. 
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formed  from  the  54  cepstral  coefficients  for  each  speaker  under  three  different  heart  rates  were  studied  for 
correlation  with  the  heart  rates. 

Fig.  1  shows  the  distribution  of  cepstral  coefficients  with  frames  for  speaker  VL.  As  given  in  Table  11,  the 
pitch  frequencies  for  the  low,  medium  and  high  heart  rates  for  this  speaker  lie  within  the  range  of  100  Hz  to  155  Hz. 
This  range  is  the  smallest  for  all  the  speakers  considered.  Because  of  this  low  range  of  fundamental  frequency 
variation,  only  cepstra  at  indices  26,  28,  31,  33,  36,  38  and  41  (corresponding  to  frequencies  101.6  Hz,  109.4  Hz, 
121.1  Hz,  128.9  Hz,  140.6  Hz,  148.4  Hz,  and  160.2  Hz,  or  straight  indices  4  through  10)  are  shown  in  Fig.  1.  These 
cepstra  were  scaled  by  the  total  of  the  logged  spectrum  over  the  entire  utterance,  %ulVs  eye"  and  normalized  to  zero 
mean  and  unity  variance. 

For  comparison,  the  scaled  mean  of  each  of  the  54  cepstral  values  across  all  the  frames,  the  total  logged 
spectrum  and  the  standard  deviation  of  the  scaled  cepstra  are  shown  for  the  three  utterances  of  the  speaker  VL  in  Fig. 
2. 

Table  U 

Range  of  FO  at  Measured  Heart  rates  for  speaker  VL 


Index 

Pitch  Range, 
Hz 

Heart  Rate 

Norm. 
Heart  Rate 

VL065G 

100  -  152 

103.3 

1.490 

VL045G 

105-117 

100.3 

1.019 

VL093G 

110-140 

97.45 

0.673 

It  must  be  noted  that  the  average  value  of  each  of  the  54  mel  cepstral  coefficients  across  all  the  frames  for  a  speaker 
is  almost  constant  regardless  of  the  heart  rate.  Fig.  3  shows  the  mean  and  the  standard  deviation  of  the  cepstral 
coefficients  in  the  expanded  scale  for  the  pitch  range  of  frequencies  for  speaker  VL.  For  comparison.  Fig.  4  shows 
the  mean  and  the  standard  deviation  of  the  cepstral  coefficients  for  speaker  RD  at  three  different  heart  rates.  The 
heart  rates  used  and  the  pitch  frequencies  at  these  rates  for  RD  are  given  in  Table  III.  Note  that  RD  has  the  widest 
range  of  pitch  frequency  variation;  the  range  of  heart  rate,  however,  is  one  of  the  smallest.  The  mean  values  of  each 
of  the  cepstral  coefficients  for  this  speaker,  however,  show  an  increase  with  the  heart  rate.  From  these  two  cepstral 
distributions  (and  from  the  others  not  shown,  but  calculated),  it  appears  that  the  mean  of  the  scaled  cepstra  is  not  a 
good  indicator  of  stress.  It  may  be  inferred  that  the  logged  spectral  energy,  while  increasing  in  frames  under  stressed 
conditions,  may  be  decreasing  in  other  frames.  Depending  on  the  phonemes  (and  hence  the  number  of  frames 
involved),  therefore,  the  overall  mean  of  each  cepstral  value  across  an  entire  utterance  may  vary  nonuniformly  with 
heart  rate. 
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p«ch  rang#,  VL06SQ>BE.  pitch  Indax  (4  lo  6)  _ Norm,  mcc  -  pitch  langt.  VLOeSQ-BE,  pitch  indax  (7  to  10) 


1 


Speaker  VL.  (a)  and  (b)  for  high  heart  rate  (VL065G) 


Norm  mcc  pitch  rtngo,  Vt045Q-BE.  pHch  Indtx  (7  lo  10)  -  33.36  38, 41 


-  pitch  ring*.  VL093Q-BE  .  pitch  IntUK  (« to  6)  , _  Norm,  mec  -  pitch  rtngp.  VL093Q-BE  ,pilchlnd«x(7  to  10) 


pitch  (100  Hz  -  160  Hz)  range  for  Speaker  VL.  (e)  and  (0  for  low  heart  rate  (VL093G) 


Fig.  3  (i)  Mean,  and  (ii)  standard  deviation  of  cepstral  coefficients  for  speaker  VL  at  high  (-),  medium  and  low 
(V)  heart  rates 

Table  ffl 

Range  of  FO  at  Measured  Heart  rates  for  speaker  RD 


Index 

Pitch  Range, 

Hz 

Heart  Rate 

Norm. 
Heart  Rate 

RD055G 

159  -  273 

77.29 

0.215 

RD021G 

101  -  181 

76.49 

0.108 

RD041G 

122-199 

72.27 

-0.454 
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Thai  the  logged  spectral  energy  varies  distinctly  based  on  the  heart  rale  is  seen  in  Fig.  2  (middle  plot;, 
which  shows  a  small  valley  at  the  beginning  of  “eye”  in  **buirs  eye**.  The  valley  becomes  more  prominent  as  the 
heart  rate  increases,  although  the  peak  logged  spectral  energy  is  approximately  the  same  in  all  three  cases.  The  same 
type  of  behavior  in  the  logged  spectral  energy  is  seen  for  speaker  RD  as  shown  in  Fig.  5.  For  this  speaker,  however, 


Fig.  4  (i)  Mean,  and  (ii)  standard  deviation  of  cepslral  coefficients  for  Speaker  RD  at  high  (-),  medium  (..),  and  low 
(V)  heart  rates 
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the  variation  betv^een  medium  (RD021G)  and  low  (RD041G)  is  not  as  distinct  as  between  high  (RD055G)  and  the 
other  two.  This  may  be  attributed  to  the  fact  that  the  low  heart  rate  considered  may  be  below  the  “normal”  rate,  as 
seen  from  the  normalized  rate  of -0.454;  hence  the  changes  in  the  logged  spectral  energy  for  the  lower-than-normal 
heart  rate  differs  from  that  for  the  higher-than-normal  heart  rates.  In  addition,  unlike  for  speaker  VL,  the  peak 
energy  appears  to  be  in  direct  proportion  to  the  heart  rate.  Based  on  the  observations  for  these  two  speakers, 
therefore,  the  variations  in  the  total  spectral  energy  with  heart  rate  may  be  hypothesized  to  occur  at  different  frames 
and  phonemes  for  different  speakers. 


Fig.  5  Log  spectral  energy  for  speaker  RD  at  (i)  high,  (ii)  medium,  and  (iii)  low  heart  rates 
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The  standard  deviations  of  the  scaled  cepstral  coefficients  at  iow-to-mid  indices  (corresponding  to 
frequencies  of  60  Hz  to  180  Hz),  in  general,  appear  to  be  the  largest  at  low  heart  rate.  This  has  been  observed  for  all 
the  speakers  listed  in  Table  I.  The  lowest  deviations  at  many  of  the  cepstral  indices  (frequencies),  however,  resulted 
at  medium  heart  rate  instead  of  at  high  heart  rate,  as  seen  in  Figs,  3  and  4.  This  anomaly  for  the  smallest  deviations 
in  the  cepstral  values  in  the  pitch  range  of  indices  was  seen  for  three  of  the  five  speakers  considered.  As  for  the 
common  behavior  for  all  the  five  speakers,  the  largest  variation  in  the  standard  deviation  occurred  in  the  range  of 
approximately  120  Hz  to  170  Hz  (corresponding  to  cepstral  index  31  to  44),  regardless  of  the  nominal  pitch 
frequency  and  its  variation  with  heart  rate.  This  is  a  noteworthy  feature  since  the  pitch  of  the  speakers  in  the 
utterances  considered  (Table  I)  varied  by  a  minimum  of  55  Hz  (for  VL)  to  a  maximum  of  172  Hz  (for  RD). 

The  distribution  of  the  cepstral  coefficients  with  time  (frames)  does  not  appear  to  have  the  same  variations 
with  heart  rate  for  different  speakers.  The  cepstrum  at  index  4  (C4,  or  frequency  100  Hz)  for  speaker  VL,  for 
example,  has  less  fluctuations  with  decreased  heart  rate.  The  same  type  of  smoothing  effect  with  decreased  heart  rate 
is  observed  in  Fig.  1  at  other  indices  in  the  pitch  range  as  well.  Although  not  as  smooth,  similar  variations  occurred 
for  speaker  RD  in  the  entire  pitch  range  of  frequencies.  For  other  speakers,  however,  the  fluctuations  in  the  cepstral 
values  increased  as  the  heart  rate  decreased. 

To  get  a  better  relationship  between  heart  rate  and  the  cepstral  values,  correlation  of  the  normalized  cepstral 
coefficients  was  carried  out.  Hidden  periodicities  in  the  coefficients  were  brought  out  in  the  autocorrelation  of  the 
coefficients  as  seen  in  Fig.  6  for  speaker  VL.  At  index  4  (corresponding  to  100  Hz),  for  example,  the  autocorrelation 
of  C4  at  moderate  heart  rate  shows  a  distinct  peak  at  approximately  frame  30  and  again  at  frame  60  .  This  indicates 
a  period  of  approximately  30  frames,  orl60  ms.  At  heart  rates  above  and  below,  however,  the  peaks  are  either  not  as 
distinct  or  are  at  different  locations  for  C4.  Similar  differences  in  the  periodic  variation  of  the  cepstral  values  are 
observed  at  other  indices  as  well.  Correlation  of  the  cepstral  coefficient  at  index  6,  which  corresponds  to  120  Hz, 
shows  a  period  of  approximately  160  ms  (30  frames)  at  high  heart  rate  (VL065G),  and  135  ms  (24  frames)  at 
medium  heart  rate  (VL045G);  at  low  heart  rate  (VL093G),  the  period  appears  to  be  about  85  ms  (15  frames),  based 
on  the  clear  peaks  after  frame  index  40  in  Fig.  6  (c). 

For  speaker  RD,  the  autocorrelation  of  cepstral  coefficients,  indicates  different  variations  with  heart  rate 
depending  on  the  choice  of  the  cepstral  index.  For  index  2  (80  Hz),  for  example,  the  correlation  has  a  period  of  1 10 
ms  (20  frames)  at  high  heart  rate  (RD055G),  160  ms  (30  frames)  at  moderately  high  heart  rate  (RD021G),  and  1 10 
ms  (20  frames)  again  at  low  heart  rate  (RD041G).  This  is  inconsistent  with  the  behavior  observed  for  speaker  VL. 

At  index  5  (1 10  Hz),  however,  the  periodicities  decrease  with  decreasing  heart  rate:  260  ms  (50  frames)  at  high 
(RD055G),  235  ms  (45  frames)  at  medium  (RD021G),  and  85  ms  (15  frames)  at  low  (RD041G).  As  with  speaker 
VL,  these  periods,  in  general,  are  consistent  with  decreasing  heart  rate  at  the  cepstral  indices  considered.  Although 
not  shown,  the  same  type  of  periodic  variation  of  the  autocorrelation  of  cepstral  coefficients  was  observed  for  all  the 
speakers  studied.  We  note  that  even  allowing  for  variable  lengths  in  the  utterances  due  to  stress,  the  decrease  in  the 
autocorrelation  period  is  consistent  with  decreasing  heart  rate. 
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(a)  high  heart  rate  (VL065G),  (b)  medium  heart  rate  (VL045G),  and  (c)  low  heart  rate  (VL093G) 


For  further  evidence  of  the  variation  of  the  cepstral  values  with  heart  rate,  cross  correlation  of  the  values  at 
a  few  indices  was  performed.  Fig.  7  shows  the  cross  correlation  of  the  cepstra  at  three  frequencies  for  speaker  VL. 

In  Fig.  7a,  the  cross  correlation  between  C4  (100  Hz)  and  C9  (150  Hz)  for  speaker  VL  shows  four  peaks  at  high 
heart  rate,  one  major  peak  at  moderate  heart  rate,  and  two  major  peaks  at  low  rate.  The  common  peak  at  frame  index 
30-35  for  high  and  medium  heart  rates  indicates  that  the  cepstra  at  100  Hz  and  150  Hz  are  highly  correlated  around 
the  first  third  of  the  utterance;  at  low  heart  rate,  however,  C4  and  C9  seem  to  be  negatively  correlated.  At 
frequencies  150  Hz  (C9)  and  200  Hz  (C14)  (Fig.  7b),  the  common  peak  between  high  and  medium  heart  rates  occur 
in  the  middle  of  the  utterance  while  for  low  heart  rate,  no  well-defined  peak  appears.  Considering  cepstra  at  100  Hz 
(C4)  and  200  Hz  (C14)  (Fig.  7c),  a  high  degree  of  correlation  appears  at  the  beginning  (around  frame  10)  and  again 
in  the  middle  of  the  utterance  for  high  and  medium  heart  rates.  At  low  heart  rate,  only  the  middle  of  the  utterance 
shows  a  large  degree  of  correlation  between  100  Hz  and  200  Hz,  Based  on  the  three  frequencies  considered,  it 
appears  that,  depending  on  the  heart  rate,  a  choice  of  cepstral  indices  may  result  in  a  better  correlation  of  cepstra  at 
different  frequencies. 

For  speaker  RD,  cross  correlation  of  the  cepstral  values  at  low  frequencies  (between  100  Hz  and  150  Hz,) 
showed  almost  identical  variations,  indicating  that  the  cepstra  C4  and  C9  vary  similarly  at  all  heart  rates.  Although 
similar  identical  variations  were  seen  for  150  Hz  and  200  Hz,  a  much  stronger  positive  correlation  appeared  at  mid 
point  of  the  utterance  at  medium  heart  rate  compared  to  those  at  high  and  low  heart  rates.  Between  100  Hz  and  200 
Hz,  the  cross  correlation  did  not  indicate  a  very  distinguishing  feature  among  the  three  heart  rates  considered.  As 
with  speaker  VL,  therefore,  the  choice  of  cepstral  indices  appears  to  govern  the  degree  of  correlation  among  cepstra 
at  different  heart  rates.  The  same  general  trend  in  the  cross  correlation  of  cepstral  values  at  the  three  frequencies 
used  was  observed  in  the  other  speakers  considered. 


V.  Cepstral  Features  in  the  Formant  Range  of  Frequencies 

To  study  the  acoustical  attributes  of  stress  in  the  formant  reinge  of  frequencies,  cepstral  values  in  the  high 
frequency  range  were  computed  for  the  utterance,  *'buirs  eye**  spoken  by  the  fighter  controllers  at  various  heart  rates. 
Because  of  the  changing  formants  and  their  bandwidths  due  to  stress,  a  range  of  frequencies  from  approximately  250 
Hz  to  4932  Hz,  with  increasing  bandwidths  at  each  frequency,  were  chosen  for  cepstral  computation.  Table  IV  lists 
the  approximate  frequencies  and  the  triangular  bandwidths  used. 
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Table  IV 

Frequencies  and  Bandwidths  used  for  High-Frequency  Cepstra 


Index 

Frequency,  Hz 
Bandwidth,  Hz 

Index 

Frequency,  Hz 
Bandwidth,  Hz 

Index 

Frequency,  Hz 
Bandwidth,  Hz 


1 

2 

250 

281.5 

30 

33 

11 

12 

752 

833.7 

77.8 

85.6 

21 

22 

2054.2 

2266.1 

201.8 

222 

3 

4 

316.1 

354.3 

36.3 

39.9 

13 

14 

923.6 

1022.5 

94.2 

103.6 

23 

24 

2499.2 

2755.6 

244.2 

268.6 

5 

6 

396.2 

442.3 

43.9 

48.3 

15 

16 

1131.2 

1250.8 

113.9 

125.3 

25 

26 

3037.7 

3347.9 

295.5 

325 

7 

8 

493 

548.8 

53.1 

58.5 

17 

18 

1382.4 

1527.2 

137.8 

151.6 

27 

28 

3689.2 

4064.6 

357.5 

393.3 

9 

10 

610.2 

677.8 

64.3 

70.7 

19 

20 

1686.4 

1861.5 

166.8 

183.5 

29 

30 

4477.6 

4931.9 

432.6 

475.9 

As  with  the  low  quefrency  cepstra,  the  mean  cepstral  values  (Figs.  8a  and  9a)  at  the  higher  end  of  quefrency 
are  almost  invariant  with  high,  medium  and  low  heart  rates.  The  standard  deviations  (Fig.  8b  and  9b)  also  show 
behavior  similar  to  those  at  low  quefrencies:  cepstra  at  low  heart  rates  have,  in  general,  higher  deviations  than  those 
at  higher  heart  rates.  Unlike  the  cepstra  in  the  pitch  range,  the  deviations  at  low  heart  rate  show  an  increase  over  the 
high  and  medium  heart  rates  at  almost  all  frequencies  considered.  However,  the  difference  in  the  deviations  between 
medium  and  high  heart  rates  is  not  significant  or  consistent  across  the  cepstral  indices.  Depending  on  the  speaker 
and  the  heart  rate  difference,  the  difference  in  the  deviations  also  vary.  This  type  of  behavior  with  high  standard 
deviations  at  low  heart  rales  was  observed  for  all  the  speakers  considered. 


Of  the  30  indices  of  cepstral  values,  four  indices  at  4,  14,  18  and  23  were  chosen  for  correlation.  These 
indices  correspond  to  354.3  Hz,  1022.5  Hz,  1527.2  Hz,  and  2499.2  Hz  with  bandwidths  of  39.9  Hz,  103.6  Hz,  151.6 
Hz,  and  244.2  Hz  respectively.  We  note  that  these  frequencies,  except  for  1022.5  Hz,  are  approximately  in  the  mid 
range  of  the  first  three  formants.  The  cepstral  values  at  these  frequencies  do  not  seem  to  indicate  any  discernible 
pattern  of  behavior  with  heart  rates. 

Autocorrelation  of  the  cepstra  at  the  selected  indices,  as  with  that  at  low  frequencies,  shows  a  periodic 
behavior.  Fig.  10  depicts  the  autocorrelation  at  the  four  frequencies  for  speaker  VL.  The  period  in  the 
autocorrelation  at  354.3  Hz  shows  an  increase  with  decreasing  heart  rate:  28  frames  at  high,  30  frames  at  medium 
and  35  frames  at  low.  Similar  increase  in  the  period  can  be  seen  at  2499.2  Hz.  This  increase  is  in  contrast  with  the 
decreasing  period  in  the  autocorrelation  of  cepstra  at  low  frequencies  (Fig.  6).  At  the  two  other  chosen  frequencies, 
a  well-defined  periodic  variation  is  not  observed. 
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Cdpstral  Frequency,  Hz 


Figs.  8(a)  Mean  and  (b)  standard  deviation  of  cepstral  values  at  the  high  end  of  frequencies  at  high  (-),  medium 
and  low  (V)  heart  rates  for  speaker  VL 

The  above  behavior  was  also  observed  in  the  autocorrelation  of  the  cepstra  for  speaker  RD.  For  this 
speaker,  however,  the  low  heart  rate  does  not  yield  a  proportionally  longer  periods  at  the  chosen  cepstral  indices. 
This  again  demonstrates  that  the  stress-related  changes  in  the  cepstra  occur  at  different  indices  for  different  speakers. 
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Figs.  9  (a)  Mean  and  (b)  standard  deviation  of  cepstral  values  at  the  high  end  of  frequencies  at  high  (-),  medium 
and  low  (V)  heart  rates  for  speaker  RD 


Cross  correlation  of  the  cepstra  at  the  selected  indices  (Fig.  1 1)  shows  a  smoothing  trend  with  fewer  peaks 
as  the  heart  rate  decreases  from  high  to  medium  to  low.  This  smoothing  behavior  was  consistently  seen  at  other 
frequencies  as  well.  In  addition,  the  same  trend  was  observed  for  each  of  the  speakers  considered. 
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Autocorrelation  of  cepstral  values  at  (i)  354.3  Hz,  (ii)  1022.5  Hz,(iii)  1527.2  Hz,  and  (iv)  2499.2  Hz  for  speaker  VL 
I  heart  rate  (VL065G),  (b)  medium  heart  rate  (VL045G),  and  (c)  low  heart  rate  (VL093G) 


Fig.  1 1  Cross  correlation  of  cepstral  values  at  selected  indices:  (i)  354.3  Hz  and  1022.5  Hz,  (ii)  354.3  Hz  and  1527.2  Hz,  and  (iii)  354.3  Hz  and  2499.2  Hz  for 
speaker  VL 

(a)  high  heart  rale  (VL065G),  (b)  medium  heart  rate  (VL045G),  and  (c)  low  heart  rale  (VL093G) 


VI.  Discussion 


The  present  work  analyzed  features  based  on  the  gross  variations  of  mel-distributed  cepstra  at  selected 
indices  to  determine  any  correlation  with  a  speaker’s  heart  rate.  From  the  different  parameters  considered  using  low- 
and  high-frequency  cepstra,  it  appears  that  the  cepstral  values  themselves  do  not  seem  to  indicate  a  direct 
relationship  to  the  speaker’s  heart  rate,  at  least  not  at  the  indices  chosen.  The  mean  cepstral  value  at  each  index 
across  an  utterance  is  virtually  constant  for  a  speaker  regardless  of  his  heart  rate.  While  this  feature  is  not  useful  in 
analyzing  stress  based  on  heart  rate,  it  can  be  applied  as  a  robust  feature  in  speaker  and  speech  recognition 
applications. 

The  standard  deviation  of  the  cepstral  values,  on  the  other  hand,  varies  inversely  with  heart  rate.  Regardless 
of  the  pitch  variation  for  a  speaker,  the  largest  variation  of  the  standard  deviation  was  observed  to  occur  in  a  narrow 
range  of  quefrency.  Similar  to  pitch,  therefore,  the  standard  deviation  of  cepstral  values  in  the  range  of  120  Hz  to 
170  Hz  can  be  used  to  correlate  with  heart  rate.  At  the  higher  quefrency  scale  (corresponding  to  above  1500  Hz),  in 
general,  the  standard  deviations  appear  to  be  much  larger  than  those  at  lower  quefrencies  (from  250  Hz  to  1500  Hz), 
Based  on  the  two  sets  of  deviations,  therefore,  cepstral  deviations  in  a  narrow  (pitch)  range  of  frequencies  are 
relatively  more  efficient  than  those  in  the  formant  range  in  comparing  relative  heart  rates.  Pitch-synchronous  analysis 
of  cepstra  at  selected  phonemes  is  more  likely  to  reflect  the  changes  in  the  pitch  due  to  heart  rate  changes.  More  data 
at  higher  indices  and  lower  bandwidths  may  be  needed  to  conclusively  verify  the  comparative  levels  of  heart  rate 
using  formant  range  of  frequencies. 

Although  results  of  using  Hamming  window  were  reported  here,  other  windows,  in  particular,  exponential 
[13,  14]  and  triangular  windows  were  also  used  in  the  early  part  of  this  research.  Exponential  window  was  applied  to 
the  cepstral  analysis  to  mimic  the  auditory  processing  [13]  so  that  stress  detection  may  be  reflected  in  the  observed 
cepstra.  Results  using  a  15  ms  (240  sample)  exponentially  rising  window  with  a  time-constant  of  4  ms  did  not  show 
any  improvement  over  those  using  Hamming  window.  Figs.  12  and  13,  for  example,  show  the  mean  and  the  standard 
deviations  of  cepstra  in  the  pitch  and  formant  range  for  speaker  VL. 

Peak  log  spectral  energy  is  seen  increasing  with  heart  rate.  The  location  of  the  peak,  however,  is  different 
for  different  speakers.  Based  on  a  number  of  speakers  and  different  phonemes,  the  peak  energy  feature  may  be 
calibrated  for  proportionality  with  heart  rate. 

Correlation  of  the  cepstra  at  the  selected  indices,  however,  show  the  following  trends  with  high,  medium 
and  low  heart  rates.  At  low  cepstral  indices,  autocorrelation  of  cepstra  for  a  speaker  show  peaks  occurring  at 
different  intervals  dependent  on  the  relative  heart  rates.  Autocorrelation  of  cepstra  at  low  quefrencies  have  peak 
intervals  decreasing  with  decreased  heart  rates.  The  extent  of  decrease  in  interval,  however,  depends  on  the  choice 
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of  the  cepstral  index  for  a  given  speaker.  This  is  akin  to  the  dependence  of  energy  in  a  spectral  band  on  the  stress:  if 
the  band  is  centered  at  nominal  pitch  frequency,  the  energy  variation  corresponds  to  the  shift  of  pitch  due  to  stress. 
As  with  cepstra,  therefore,  a  speaker’s  nominal  pitch  range  may  be  helpful  in  narrowing  the  range  for  spectral  shift 
determination.  At  the  selected  higher  end  of  indices,  the  peak  intervals  show  an  increase  with  decreasing  heart  rate. 
Here  again,  the  behavior  may  be  different  with  different  indices.  Also,  based  on  the  bandwidths  used  at  each  index 
and  depending  on  the  location  and  bandwidth  of  formants,  the  change  in  cepstra  with  heart  rate,  and  hence,  the 
autocorrelation  could  be  different  from  that  observed  here. 

Cross  correlation  of  cepstra  at  selected  frequencies,  such  as  the  formants  at  normal  heart  rates  for  a  speaker, 
is  more  likely  to  yield  large  variations  at  elevated  heart  rates.  These  variations  may  also  reflect  in  the  cepstral  values 
and  their  standard  deviations  at  the  selected  indices.  The  choice  of  center  frequencies  and  bandwidths  for  the  cepstra 
must  accommodate  possible  shifts  in  both  due  to  stress.  With  data  for  the  resting  heart  rate  of  a  speaker  and 
corresponding  utterance,  correlation  and  standard  deviation  of  cepstra  can  yield  values  proportional  to  the  spectral 
shifts. 


VIL  Conclusion 

In  this  project,  variation  of  a  set  of  cepstral  parameters  for  speech  spoken  at  different  heart  rates  was 
studied.  For  comparison,  the  utterance  for  "'bulVs  eye"  spoken  by  male  European  flight  controllers  was  considered  in 
each  case.  It  was  found  that  the  choice  of  cepstral  indices  determined  the  degree  of  variation  of  the  cepstra  and  their 
correlated  values  with  a  speaker’s  heart  rate.  As  with  pitch,  these  indices  varied  for  each  speaker.  If  the  cepstral 
indices  are  in  the  vicinity  of  nominal  pitch  frequencies,  the  cepstra,  their  correlation,  and  the  standard  deviations  all 
showed  proportional  variation  with  heart  rate.  Cepstra  at  higher  quefrencies  also  resulted  in  variations  proportional 
to  heart  rate  when  the  indices  and  their  bandwidths  corresponded  to  nominal  (unstressed)  values.  Mean  value  across 
an  utterance  for  each  speaker  at  each  cepstral  index  showed  an  almost  constant  value  independent  of  heart  rate. 
Hence,  the  mean  cepstral  value  at  one  or  more  indices  could  be  used  as  a  robust  feature  in  speech  and  speaker 
recognition  applications. 

Further  work  using  pitch-synchronous  cepstral  analysis  at  speaker-dependent  indices  is  expected  to  yield  a 
more  measurable  correlation  of  cepstral  features  with  heart  rate.  Once  this  correlation  is  established,  deviation  of  the 
features  for  different  phonemes  may  be  studied  at  different  heart  rates.  This  study  can  further  lead  to  the  analysis  of 
a  speaker’s  stress-induced  heart  rate  variation  using  speech-independent  cepstral  features. 
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Abstract 

This  paper  investigates  the  model  order  selection  problem  for  use  with  the  multichannel  autoregres¬ 
sive  (MAR)  process  in  airborne  radar  detection  processing  which  uses  an  Innovations  Based  Detection 
Architecture  (IBDA).  Results  indicate  that  a  low  order  model  should  be  used.  Specifically,  this  paper 
investigates  the  use  of  the  Akaike  Information  Criterion  (AIC)  and  prediction  error  power  over  indepen¬ 
dent  realizations  for  model  order  selection.  Examples  are  included  for  physically  modeled  data  sets  as 
well  as  actual  radar  data  sets. 
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1  Introduction 


It  has  been  established  that  parametric  modeling  for  target  detection  is  an  alternative  method  to  correlation- 
based  detection  methods  [1].  Parametric  modeling  using  autoregressive  processes  has  been  used  extensively 
in  the  literature  for  various  applications  in  the  univariate  case  as  well  as  in  the  multivariate  case  [2]  [3] 
[4]  [5].  However,  literature  discussing  the  application  of  multichannel  autoregressive  processes  is  relatively 
limited  for  the  airborne  radar  target  detection  application  [6].  Noticeably  absent  in  the  literature  is  guidance 
and  recommendations  for  model-order  selection  for  this  application.  This  research  investigates  the  model 
order  selection  and  applicability  of  Multichannel  Autoregressive  (MAR)  processes  to  the  airborne  radar 
surveillance  target  detection. 

The  use  of  the  MAR  structure  to  model  the  processes  associated  with  the  airborne  radar  surveillance 
scenario  lies  in  the  fact  that  this  type  of  signals  can  be  interpreted  as  a  complex  vector  time-series  signals  and 
that  the  clutter  can  be  represented  as  the  output  of  a  vector  AR  system.  Initial  results  of  such  observations, 
presented  herein  as  well  as  in  [7]  and  [8],  and  show  that  the  order  demanded  for  such  an  application  is 
relatively  low.  The  work  herein  is  in  agreement  with  results  showing  the  success  of  using  relatively  low-order 
models  for  target  detection  [9].  This  is  a  critical  point  for  airborne  surveillance  radar  applications  since 
detection  algorithms  must  be  feasible  to  implement  in  real-time  and  with  limited  computational  resources. 

In  general,  the  J-channel  multichannel  autoregressive  modeling  consists  in  estimating  a  set  of  MAR 
parameters  (complex  matrices)  A{k)  e  ,  I  <  k  <  p,k,p  e  N,  and  the  driving  noise  covariance  matrix 
E  €  ,  which  in  some  sense,  gives  the  minimum  prediction  error  and  yet  satisfy  the  principle  of  parsimony. 

A  MAR  process  can  be  realized  as  in  Figure  1,  where  U{n)  is  the  driving  noise  vector  with  correlation  matrix 
Ruu{k)^  The  complex  matrices  A{k)  represent  the  feedback  coefficients  for  the  vector  MAR  process  X{n). 


Figure  1:  MAR  model 

An  important  aspect  of  using  MAR  for  radar  detection  is  the  determination  of  the  model  order  (p) 
.  For  the  most  part,  the  selection  of  the  model  order  dictates  how  well  the  model  will  follow  the  true 
spectrum.  A  model  order  that  is  too  high  will  cause  spurious  peaks  in  the  frequency  response  as  well  as  a 
high  parameter  variance.  A  too  low  model  order  will  cause  unwanted  smoothing  of  the  exhibiting  spectrum 
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[2].  Both  cases  (too  high  or  too  low)  will  adversely  affect  detection  performance.  An  appropriate  model  order 
selection  criterion  has  been  a  challenging  formulation  for  MAR  processes.  However,  several  researchers  have 
introduced  reasonable  criteria  for  model  order  use  for  specific  scenarios.  One  of  these  criteria  is  the  {AlCf) 
Akaike  Information  Criteria,  also  known  as  An  Information  Criteria  [10].  This  research  explores  the  use  of 
the  AIC  and  the  prediction  error  power  (PEP)  for  MAR  model  order  selection  in  the  airborne  surveillance 
radar  scenario. 

Section  2  discusses  in  more  detail  the  airborne  surveillance  radar  detection  problem.  Section  3  presents 
the  MAR  theory.  Section  4  explains  the  AIC  and  PEP  for  the  multivariate  case.  Section  5  presents  numerical 
results  using  synthetic  and  actual  radar  data. 

2  Radar  Target  Detection 

In  general,  the  airborne  surveillance  radar  target  detection  consists  in  determining  whether  a  target  is  present 
or  not  in  a  large  number  of  range  cells.  Additionally,  if  a  target  is  present,  estimating  its  speed  and  range  is 
desired.  This  problem  has  been  studied  extensively  since  the  conception  of  radar  technology;  however,  new 
methods  continue  to  appear  in  the  literature.  A  driving  force  behind  continued  research  in  this  problem  is 
that  there  is  always  the  need  to  detect  the  target  faster,  cheaper  (economically  and/or  computationally)  and 
more  accurately  as  well  as  robustly. 

A  received  radar  return  at  the  J  element  array  for  range  cell  m  can  be  represented  in  matrix  form  as, 

■xi(l)  xi{2)  •••  xi(7V)- 

X2(l)  X2(2)  X2{N) 

X(m)=  .  :  :  (1) 

.xj(l)  xj{2)  ■■■  xj{N). 

where  N  is  the  number  of  pulses  in  a  coherent  processing  interval  (CPI)  .  In  general,  each  signal  component, 
Xi(n),  at  the  ith  array  element  and  at  time  n  is  equal  to 

Xi(n)  =  s(n)  +  j(n)  +  c(n)  +  u;(n)  (2) 

where  s{n)  is  the  target  return  reflection,  j{n)  is  a  possible  jammer,  c(n)  is  clutter  reflection  ,  and  w{n)  is 
a  white  noise  component. 

The  correlation  matrix  Rxx{k)  of  signal  x  is  the  J  x  J  matrix 

rn{k)  ri2{k)  ru(/c) 

r2i{k)  r22{k)  •••  r2j{k) 

rji{k)  rj2{k)  ■■■  rjj{k) 


where  each  element  (/c)  =  E{xi(n)*Xj(n  +  k)}  is  the  correlation  between  array  element  i  and  j  at  lag  k. 


In  addition  to  the  classical  correlation-based  detection  [11],  approaches  that  try  to  solve  this  problem 
are  the  Innovation  Ba^ed  Detection  Architectures  (IBDA)  ,  Adaptive  schemes,  and  Parametric  Adaptive 
Matched  Filter  (PAMF).  The  Innovation  Based  Detection  Architectures  were  first  presented  in  [12].  The 
experimental  analysis  of  IBDA  for  surveillance  radar  was  presented  in  [13].  Later  and  more  importantly 
in  relation  to  the  MAR  case  is  the  Model-Based  Detection  method  introduced  by  Zhang  and  Haykin  [1]. 
Wherein  they  develop  a  method  of  target  detection  using  the  power  spectrum  relation  that  exists  between 
the  autoregressive  coefficients  and  its  frequency  transform  as  well  as  exploring  its  use  in  target  detection  in 
a  multichannel  scenario.  The  difference  between  the  MAR  method  and  their  work  lies  mainly  in  that  their 
method  deals  with  a  stationary  radar  antenna  as  oppose  to  an  airborne  radar.  Also  they  do  not  consider  a 
whitening  processing  stage. 

An  example  of  an  adaptive  scheme  is  the  one  introduced  by  Brennan  and  Reed  [14]  which  applied  the 
adaptive  theory  popularized  by  Widrow  [15].  However,  as  in  any  adaptive  scheme,  this  method  must  take 
into  account  the  convergence  rate,  step  size  considerations,  processing  time  vs.  acquisition  time  among  many 
other  critical  conditions.  For  this  reason  modifications  to  this  seminal  paper  appeared  later  in  the  literature. 

In  [16]  a  method  is  developed  using  the  State  Space  approach.  In  this  method,  modeling  and  whitening 
is  used  to  model  ground  clutter  to  consequently  aid  in  target  detection.  Notice  that  all  MAR  models  can  be 
represented  as  state  space  models  from  a  relation  of  the  state  and  the  autoregressive  coefficients,  however 
the  opposite  is  not  necessarily  true  [17],  [18]. 

3  Multichannel  Autoregressive  Process 

In  general,  a  multichannel  autoregressive  process  can  be  described  by  the  following  equation 

p 

X{n)  = -Y,Mk)X{n-k)  +  U{n),  (4) 

fc=l 

where  A{k)  G  are  the  complex  feedback  parameter  matrices  X{k)  e  is  the  data  column  vector,  and 

U{n)  is  a  multichannel  white  input  vector  process  whose  autocorrelation  function  satisfies  Ruu{k)  —  E<5(fc), 
where  E  is  the  variance  of  the  driving  process.  Notice  that  this  type  of  modeling  can  also  be  viewed  as  an 
innovation  process  as  described  in  [19]. 

In  innovation-based  detection  architectures  (IBDA)  ,  the  received  signal  is  converted  to  an  innovations 
process  as  shown  in  Figure  2. 

The  filtering  of  X{n)  produces  an  output  £(n),  which  has  been  whitened  temporally  as, 

p 

£(n)  -  X{n)  +  A{k)X{n  -  k)  (5) 

A:=l 

notice  that  e{n)  may  be  viewed  as  the  prediction  error  of  the  classical  linear  prediction  problem. 

The  idea  behind  IBDA  is  that  the  prediction  filter  coefficients  A{k)  would  contain  information  about 
the  received  signal  (clutter,  jamming,  signal),  and  hence  a  detection  test  could  be  based  upon  the  achieved 
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m 


Figure  2:  Innovations  Process 


parameterization  of  the  ^(A:)s. 

A  solution  for  finding  the  set  of  MAR  parameters  {A{k),l  <  fc  <  p}  is  by  solving  the  multichannel 
Yule-Walker  equations 


R{0)  i?(-l)  • 

R{1)  R{0) 

■  R{~p  + 1)' 

•  Ri-p  +  2) 

A^i2) 

■mr 

R{2) 

R{p-1)  R{p-2)  ■■ 

RiO)  . 

-A'^ip). 

.Rip). 

where  R{k)  ~  E{X*(n)X'^{n  -h  A:)}  and  it  can  be  shown  that  R{~k)  =  R^{k), 
Then  the  parameters  are  found  from  correlations  as  follows 


A'^il)- 

A'^{2) 

1 

R{-1)  ■ 

fi(0)  ■ 

■■  Ri-p  +  i)' 
■■  R{-p  +  2) 

-1 

- /?(!)- 

Ri2) 

A'^{p). 

.Rip-i) 

Rip -2)  ■ 

•  •  RiO)  . 

-Rip). 

(7) 


However,  lacking  a  closed  form  expression  for  the  needed  correlation  matrices  R{k),  estimated  values  R{k) 
are  computed  from  the  received  signal.  To  ensure  positive  definiteness  of  R{k),  typically  the  biased  estimates 
are  used.  The  driving  covariance  matrix  is  found  as  follows 

E  -  E{X*{n)[X{n)- Xin)]^} 

=  RiO)  +  El^X*{n)J2x'^in-k)A^{k)^ 

P 

=  R{0)  +  Y^Ri-k)A^{k) 

k=l 

=  RiO)  +  ^Rf^{k)A^ik)  (8) 

fc=i 


where  X{n)  is  the  forward  prediction  define  as  X{n)  =  -  A{k)X{n  -  k).  Notice  that  the  prediction 
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error  power,  here  called  PEP^  is  equal  to  the  trace  of  S  and  the  off  diagonals  elements  correspond  to  the 
(spatial)  cross-correlation  of  the  driving  noise. 

The  relation  between  the  spectrum  of  the  multichannel  signal  X  (n)  and  the  two  dimensional  spectrum 
is  discussed  in  [20],  a  brief  formulation  of  this  result  follows. 

The  transfer  response  of  a  multichannel  impulse  response  matrix  H{k)  is  given  by 

■Hn{z)  Hi2{z)  •••  Him{z)- 

H2liz)  H22{z)  •••  H2m{z) 

H{z)=  :  ;  ;  (9) 

.Hmi{z)  Hm2{z)  ••• 

where  Hij{z)  =  YlT=o  Hij{k)z~^.  For  a  finite  time,  say  fc  =  0, 1, . . . ,  iV,  the  matrix  H{z)  can  also  be  written 
as 
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The  two  dimensional  transfer  function  is  given  by 

K  K 

g{zi,z2)  = 

A:=0  /=0 

Notice  that  the  elements  ak,i  can  be  arranged  in  matrix  form  as  shown  below 

•  O,0K 

aio  ail  •••  aiK 

aKo  a>Ki  •••  a,KK 
Then  the  two  dimensional  transfer  function  can  be  written  as 

G(zi,Z2)  =  [aoo  +001^:^^  +  . . .  +  aoicz^^] 

H-  Zi^  [aio  +  011^2  ^  +  . . .  “h  aiKZ2  *f  . . . 

+  z:[^[aK0  +  aK\Z2^  +  ...  +  aK2Z2'^] 


(10) 


(11) 


Now  let  ai  =  then 

G{zi,Z2)  =  {[all  alHl 

where  Z2  is  the  middle  matrix  of  (13)  and  Zi  is  the  rightmost  vector  of  (13).  Using  this  result  G(zi,  Z2)  ~  a 
can  be  written  as 

-  Hu{0)  +  --  +Hu(N)z2'^  Hi2(0)  +  ---  +  Hi2iN)z-^  //im{0)  +  •  •  ■  + 1  [  1  ' 

/f2l(0)  +  ---  +  i/2l(Ar)22-^  //22(0)  +  ---  +  i/22(Af)z2"'^  /12m{0)  +  •  •  •  + /?2M 

^0  .  ... 

+  •  •  •  + //Afl(^)Z2  ^  ■^^M2(0)  + - ^  - + 

Therefore 

G(zi,Z2)  =  alH{z2)Zi,  (14) 
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ho 


and  the  power  spectral  density  for  the  MAR  process  is  given  as 


Par{zi,Z2)  = 


(15) 


\H{ZuZ2)\^ 

where  ao  =  [cr  0  0  ...  0]  cr  is  the  2-D  prediction  error  variance,  and  H{z2)  =  • 

In  a  later  section,  the  resulting  MAR  2D  spectrum  will  be  compared  to  the  spectral  estimate  produced  by 
the  taking  the  classical  periodogram  of  the  actual  radar  data. 


4  Multidimensional  Autoregressive  Model  Order 

The  AIC  for  real  multidimensional  autoregressive  processes  was  first  introduced  by  Akaike  in  [21]  and  is  one 
of  the  model  order  indicators  most  cited.  The  AIC  criterion  for  real  MAR  processes  [2]  is  given  by 

AIC{p)  =  Nln{\\t\\)^2J^p  (16) 

where  p  is  the  model  order,  E  is  the  pth  order  estimate  of  the  prediction  error  covariance  matrix,  J  is  the 
number  of  channels,  and  ||  •  ||  indicates  the  determinant  operation.  Notice  that  since  (16)  was  developed  for 
real  data,  the  direct  application  to  the  multichannel  airborne  radar  problem  (with  complex  signals)  is  not 
readily  apparent.  The  following  development  of  the  Aids  a  synthesis  of  the  various  A /C  development  found 
in  the  literature.  It  has  been  included  here  under  a  unifying  notation  for  extension  the  radar  data  case. 

It  is  assumed  that  a  J  x  Dvector  X  can  be  represented  by  the  following  multidimensional  autoregressive 
model: 

p 

X{n)  =  - A{m)X{n  -  m) +  U{n),  (17) 

m=l 

where  A{m)  is  a  J  x  J  matrix  and  U{n)  is  a  random  J  x  1  vector  with  the  following  relations 

E{U{n)}  =  0,  {OgC*^} 

E{U{n)X’^{n~m)}  =  0, 

E{U{n)U{mf}  ^  J(n  -  m)E 

where  E  is  the  expectation  operator,  and  E  is  a  positive  definite  J  x  J  matrix. 

An  important  assumption  in  the  development  of  the  AIC  is  the  Gaussianity  of  the  least  squares  estimate, 
denoted  by  A(m),  of  A(m), 

•  E(X){^y/N[Aij (m)  Aij (m)] }  =  0,  —  = 

•  ^oo{A^[Aiiji(m)  -  A,,j,(m)][Ai,,^2(n)  -  n, ^2). 

where  Eoo  is  the  limiting  expectation.  Now,  consider  the  one-step  prediction  error  when  A{m)  is  applied  to 
another  independent  realization  of  X{n) 


IMO 


D(n)  =  X;  (a(  m)  -*  A{m)^  X{n  —  m)  +  U{n). 

m=l 

The  variance  of  this  error  with  respect  to  an  independent  realization  X  (n)  is  derived  next.  Let 
D{n)D'^{n)  =  (  -  A{m)]X{n  -  m)  +  U{n)\  (^[>l(/)  -  Ail)]Xin  -  1)  +  U{n) 

\m=l  /  \i=l 

P  V  V 

=  ^  LA{m)X{n  -m)J2  -  l)AA^{l)  +  ^  AA{m)X{n  -  7n)t/’’(n) 

m=l  f=l  m=l 

P 

+  U{n)  X'^in  -  1)AA'^{1)  4-  U{n)U'^{n). 

i  =  l 

where  Ai4(m)  =  A{m)  —  A{m).  Taking  the  expectation  w.r.t.  X  results  in  the  following 


p  p 

E{D{n)D‘^{n)}  =  ^  ^  A/l(m)E{X(n  -  m)A:^(n  -  Z)}A^^(0 

m-l  l=l 
P 

H-  AA(m)E{X{n  -  m)U^{n)} 

m=l 

P 

+  ^  E{U{n)X'^{n  -  1)}aA'^(1)  +  E{U{n)U'^{n)} 

p  p 

=  AA{7n)E{X{n  -  m)X'^{n  -  1)}aA^{1). 

m=l  i=:l 

Notice  that  an  element  of  the  product  of  three  matrices  can  represented  as  follows: 

{A{BC))ih  =EjMBC)jh 

”  YLj  Sp  AijBjgCgh 

Letting  A  =  A{m),  B  =  R,  and  C  =  A^{m)  then 

j  J 

{ARA^)ih  =  Ajj  {7n)Ah,g{l)Rj^g 

j=l  9=1 

Therefore  the  h-th  element  of  the  second  term  of  E{D{n)D'^ (n)}  is 

P  P  J  J 

S  S  S  S  ^^ijim)^Ah,g{l)R  XX 

m—l  1—1  j=l  g=l 

Taking  the  E^o  of  the  above  expression  results  in 

Eoo{N[AAij{m)][AAh,g{l)]}Rxx{'m,  j]l,  g) 

m=l  1=1  j=l  g=l 
^  P  P  J  J 

m=l  1=1  j=l  g=l 
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The  above  result  can  be  thought  of  as  taking  the  trace  of  an  Jp  x  Jp  identity  matrix.  Thus  the  above 
operation  is  equal  to  Then,  the  variance  of  the  prediction  error  is  equal  to 

Eoo{£x{Z)(nfT>(n)}}  =  E  +  ^E.  (18) 

The  determinant  of  this  expression  is  equal  to 

||Eoo{£.{T»(n)D(n)^}}||  =  (^l  +  ^yi|E||.  (19) 

This  formula  is  called  the  multiple  final  prediction  error  (MFPE)  .  However,  the  approximation  of  the 
covariance  matrix  must  be  taken  into  account  since  the  true  E  is  often  unavailable.  The  derivation  of  E 
follows  as; 


1  ^  [X{n)  -  X{n))  (X(n)  -  ^(n))^ 

n=l 

1  ^  /  p  p  ^  \ 

XI  “  I  H  ^(m)X(n  -  m)  +  C/(n)  +  ^  A{m)X{n  -  m)  J 

n=l  \m=l  m  =  l  / 

/  P  P  x’r 

-  Ail)Xin  -l)  +  U{n)  +  ^  A{l)X{n  -  1) 

\  l=l  ) 

|l/(n)  -  ^  AA{m)Xin  -  m)\  lu{n)-'^AA{l)X{n-l)\ 
n=l  1.  m=l  J  I.  i=l  J 

I  ^  (  P  P 

Tj  S  U{n)U'^{n)  -  U{n)^^X'^{n  -  1)aA^{1)  -  ^  AA(m)X(n  -  m)U'^(n) 


n=l 

P 


m  =  l 


+  AA(m)X(n  -  m)'^X'^{n  -  1)aA^{1) 


m=l 


(20) 


/=i 


However,  this  simplifies  by  noting  from  the  orthogonality  principle  : 


Thus 


0  =  ^  (A:(n)  -  i'(n))  X^(n  - /) 

n=l 

N  /  p  p  ^  \ 

=  XI  (  “  XZ  A{m)X{Ti  -m)+  A{Tn)X(n  -  m)  J  X'^{n  -  1) 

n=l  \  m=l  m=l  / 

=  Ef  U(n)  -  AA{7n)X{n  -  m)^  X'^{n  -  1). 


n=l 


m=l 


^  Uin)X^in  -  0  =  ^  E  E  AA{m)X{n  -  m)X^{n  -  1). 
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Applying  the  above  equality  to  the  second  and  third  term  of  the  last  line  of  equation  (20)  results  in 
=  AA(jn)  (u{n)X'^{n  -  m))^  -  (u{n)X'^{n  -  i))  AA^(Z) 

m=l 

pTp  i^prp  *1 

=  —  ^  AA(m)  ^  AA(/)A(n  - /)X^(n  -  m)  ^  AA(7n)X(n  -  m)X^(n  — /)  aA^(/) 

m=l  ■/=!  .  /=1  _ni=l 

P  P  P  P 

=  -EE  AA{Tn)X{n  -  m)X'^{n  -  1)aA'^{1)  -  EE  AAim)X{n  -  m)X^(n  -  1)aA^{1) 

m=l  1=1  Tn=l  i=l 

P  P 

-  -2EE  AA(7n)X(n  -  m)X^{n  —  /)AA^(/). 


Therefore 


=  lY^Uin)U^{n) 


^  A(n  —  7n)A^(n  —  /)  aA^(/) 


m=l  i=l 


+  EEA^(m)  ^Y^X{n-  m)X'^{n  -  1)aA{1) 


l=l  771  =  1 

N 


lj2uin)U^in) 

n=l 

-  E^  El  E 


Then  taking  the  infinity  expectation  results  in 


yielding 


l|Sll=(l-f)  "lISpll 


Therefore  putting  (22)  in  (18)  results  in  the  proposed  model  order  which  gives  the  minimum  of  the 
multiple  final  prediction  error 


MFPi;(p)=  (l  +  ^)7l-^)  "||S,||. 
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The  development  of  the  multichannel  AlC'm.  (16)  follows  from  the  above  result  by  finding  the  maximum 
likelihood  of  the  joint  Gaussian  density  function.  It  is  assumed  that  a  set  of  independent  Gaussian  real  vectors 
with  positive  definite  time-invariant  covariance  matrix  E  is  given.  The  joint  Gaussian  density  function  is 


fNj{U)  = 


(27r)5‘^^| 


1  x^N 

e  2  2^k=i 


(24) 


Taking  the  natural  log  of  the  above  function  and  multiplying  it  by  -2  results  in 

N 

L  =  JNln{2iT)  +  Nln\\E\\  +  J2u'^{n)'£-^U{n) 

n=l 

=  J7Vln(27r)  +  .?^ln  ||Ei|  +  Ntr  t^^(n)t/(n)E-' j 

=  JArin(27r)  +  7Vln||E||  +  iVtr(EoE-i)  (25) 

where  Eq  =  jj'^U'^{n)U{n).  Taking  the  partial  derivative  with  respect  to  Ej^  of  the  above  expression 
results  in 


dL 


Ntr  (  -  Nti  (  EoE-^J^E"* 


aE, 


aE, 


Equating  this  expression  with  zero  results  in 

aE  ^ 


tr 


aEv 


Then  E^^  —  (EqE  *E  ^)jj,  which  implies  that  Eq  =  E.  Therefore,  to  maximize  (25),  A^ln||Eo||  needs 
to  be  maximized.  Now  substituting  for  E  the  MFPE  estimate,  (23)  yields 

-1 


NlnllEII  =  iVln 


=  Nln 


1  4- 


II 

N 


1  - 


Ie 

N 


Jp 


+  Nln  1 


N 


-j 


+  Nln||Ep|| 


JTVln  1  +  ^  _  JTVln  1-  +Nln||E 


Jp 


N 


JN^  +  JN^+Nln\\t,\ 


N 


=  2j2p  +  Nln(||S||). 

This  result  is  called  the  AIC  and  it  has  been  shown  that  this  result  and  (23)  predict  the  same  model  order 

[4]- 
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4.1  AIC  limitations 


It  has  been  shown  that  the  AIC  criterion  tends  to  select  too  high  order  models  and  will  overestimate  the 
true  order  of  a  finite  order  autoregressive  model  [22].  Moreover,  Nuttall  presented  in  [5]  an  upper  bound  on 
the  model  order  for  which  the  AIC  may  be  considered  to  be  reliable.  This  upper  bound  is  given  by 

sVn 


m3x{pAic)  < 


J 


(26) 


The  A/C  should  not  be  considered  reliable  for  model  orders  exceeding  mdix{pAic)^  That  is,  while  the  AIC 
may  indeed  be  computed  for  high  model  orders,  it  may  be  an  inaccurate  measure  for  the  model  order.  Nuttall 
found  this  result  by  extending  to  the  multidimensional  case  the  upper  bound  for  the  real- valued  univariate 
case  presented  by  Akaike  [5].  Basically,  this  upper  bound  results  from  taking  the  ratio  of  the  number  of 
scalar  coefficients  and  the  number  of  available  scalar  data  points 

JN  -  ^ 

solving  for  p  in  this  equation  and  calling  it  paic  results  in  (26).  Also  notice  that  this  upper  bound  is  a 
tighter  upper  bound  than  that  suggested  by  Akaike  in  [21],  which  is  given  by  either  ^  or 

For  the  present  airborne  radar  case,  a  typical  return  data  length  is  short  and  complex,  thus  limiting  the 
usefulness  of  the  AIC  when  considering  Nutall  upper  bound.  This  lead  us  to  consider  a  justification  for 
model  order  selection  based  on  the  prediction  error  power.  In  the  next  section  we  first  study  the  relation 
between  equation  (26),  which  depends  on  the  data  length,  and  the  prediction  error  power  which  is  calculated 
for  independent  radar  return  realizations.  These  experimental  simulations  show  that  for  large  model  order 
there  is  a  reduction  in  prediction  error  power  for  the  range  cell  under  study  but  at  the  expense  of  parameter 
variance.  This,  in  turn,  leads  to  large  prediction  error  power  for  independent  range  cells  and  thus  could 
serve  as  an  indicator  for  model  order  selection.  The  simulations  also  verify  the  Nuttall  upper  bound  from 
the  prediction  error  power  view  point. 


5  Computer  Simulations 

In  this  section  computer  simulations  are  presented  for  the  prediction  error  power,  model  order  selection, 
prediction  error  power  for  independent  data  realizations,  and  spectral  analysis  of  modeled  as  well  as  actual 
radar  data  from  the  Multi-Channel  Airborne  Radar  Measurement  (MCARM)  project.  The  simulated  return 
data  were  generated  by  MATLAB  routines  developed  by  Scientific  Studies  Corporation  [23].  The  parameters 
used  for  the  synthesized  radar  data  are  given  in  the  table  below 

5,1  Prediction  Error  Power  for  simulated  data 

Figures  4  5,  6,  and  7  show  the  prediction  error  power  for  simulated  airborne  return  data  sets  of  lengths  64, 
128,  200,  and  550.  These  plots  are  generated  as  described  in  the  flowchart  shown  below.  The  simulations 
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J  =  14 

Number  of  linear  array  elements 

AT  =  64, 128, 200, 550 

Number  of  pulses  in  one  (CPI) 

(f)o  =  30 

Array  main  beam  azimuth  angle  (deg) 

fpRF  = 300 

Pulse  repetition  frequency  (Hz) 

fc  =  450 

Transmit  frequency  (MHz) 

UNIFORM 

Array  pattern 

Hp  =  9 

Platform  altitude  (km) 

Vp  =  25 

Platform  velocity  (m/s) 

rc  =  130 

Range  to  desired  ground  clutter  ring  (km) 

o 

II 

Aircraft  platform  crab  angle  (deg) 

Nc  =  361 

Number  of  ground  clutter  patches  in  the  clutter  ring 

varn  =  1 

Noise  power  in  each  channel 

5Ari?  =  3 

Target  signal-to-noise  ratio  (dB) 

fs 

Spatial  Frequency  {—0,25} 

ft 

Temporal  Frequency  {0.25} 

Nr  =  10 

Number  of  independent  realizations 

Table  1:  Simulation  Parameters 


are  performed  for  MAR  model  orders  from  1  through  10.  Once  the  model  order  is  selected  the  MAR  param¬ 
eters  are  calculated  as  described  in  (7)  where  biased  estimates  of  the  correlation  matrices  are  used  from  the 
received  signal.  The  prediction  error  power  is  calculated  by  taking  the  trace  of  the  driving  noise  covariance 
matrix  as  calculated  in  (8).  One  salient  feature  of  these  prediction  error  power  plots  are  that  the  difference 
between  the  PEP  of  order  1  and  those  of  higher  order  are  within  2.5dB.  This  means  that  any  increase  in  the 
model  order  does  not  contribute  significantly  to  any  additional  prediction  error  reduction.  Also  notice  that 
since  the  prediction  error  power  is  calculated  for  a  given  model  order  the  length  of  these  plots  is  independent 
of  the  length  of  data  under  study. 


Figure  3:  PEP  flowchart 

Prediction  Error  Power  Prediction  Error  Power 


5.2  AIC  for  simulated  data 

Figures  8,  9,  10,  and  11  show  ^l/C  plots  for  simulated  data  of  lengths  64,  128,  200,  and  550.  The  AIC  is 
calculated  as  in  (16),  repeated  here  for  clarity 

AICip)=Nin{\\t\\)  +  2J^p 

In  general,  the  proper  model  order  is  usually  that  which  minimizes  the  AIC.  However,  notice  that  these 
figures  do  not  show  the  expected  concave  behavior  for  the  AIC  curves.  These  AIC  curves  are  thus  better 
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Prediction  Error  Power  Prediction  Error  Power 


explained  in  conjunction  with  the  Nuttall  upper  bound.  Notice  that  for  Figure  8  the  minimum  AIC  is  8, 
while  on  the  contrary  the  Nuttall  upper  bound  for  this  data  set  dictates  that  it  not  exceed  1.7  as  indicated 
in  the  table  below.  This  indicates  that,  the  .4/(7  should  be  used  cautiously  for  such  short  data  sets. 


Data  points 

Nutall  upper  bound 

Model  order  estimate  by  AIC 

64 

1.7 

8 

128 

2.4 

10 

200 

3.0 

2 

550 

5,0 

4 

Table  2:  Nuttall  upper  bound  and  AIC  relation 
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Akaike  Information  Criterion 


Akaike  Information  Criterion 
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Figure  8:  AIC  for  64  data  points 


Model  Order  (p) 

Figure  9;  AIC  for  128  data  points 


Akaike  Information  Criterion 


Akaike  Information  Criterion 
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Figure  10:  AIC  for  200  data  points 


Model  Order  (p) 

Figure  11:  AIC  for  550  data  points 


5.3  PEP  for  Independent  Realizations 

As  discussed  in  the  previous  section  the  AIC  has  some  limitations,  in  particular  the  short  temporal  support 
of  the  radar  return  signal  restricts  the  use  of  the  AIC,  Therefore,  an  alternative  means  of  justifying  the  MAR 
model  order  based  on  the  prediction  error  power  follows. 

Model  order  selection  information  can  be  extracted  from  the  prediction  error  power  (PEP)  of  independent 
data  realizations  as  shown  in  the  flowchart  below.  The  independent  realizations,  for  a  given  data  length,  are 
generated  with  MATLAB  routines  supplied  by  [23].  For  the  case  of  actual  radar  return  data,  independence 
is  related  to  different  range  cells  for  a  particular  data  acquisition. 


Figure  12:  PEP  flowchart  for  independent  data  realizations 

The  simulation  consists  in  calculating  the  MAR  parameter  matrices  for  one  range  cell  using  (7)  with 
biased  estimates  of  the  correlation  matrices  and  using  these  parameters  to  calculate  the  prediction  error 
power  for  independent  data  sets.  Then,  the  prediction  error  power  is  calculated  by  taking  the  trace  of  the 
driving  noise  covariance  matrix  as  calculated  in  (8). 

The  idea  behind  this  simulation  is  to  observe  that  for  low  model  orders,  the  set  of  MAR  parameters 
calculated  for  the  range  cell  under  study  can  be  used  for  independent  range  cells.  This  observation  can  be 
seen  in  Figures  13,  14,  15,  and  16.  These  plots  show  that  indeed  the  average  PEP  (x  -  solid  line)  for  a  set  of 
independent  data  realizations  (d2ished  lines)  varies  dramatically,  away  from  the  PEP  of  the  data  under  study 
(o  -  solid  line),  from  the  /1/C  results  above  the  Nuttall  upper  bound.  This  relates  that  fact  that  the  AIC 
should  not  be  considered  reliable  for  model  orders  exceeding  the  Nuttall  upper  bound.  This  indicates  that 
the  estimated  MAR  parameters  for  one  data  set  applied  to  an  independent  data  set  from  the  same  process 
have  a  relatively  small  variance  for  model  orders  less  than  the  Nuttall  limit.  Also  notice  that  as  the  data 
length  increases  the  averaged  prediction  error  power  decreases  since  the  MAR  parameter  variance  decreases. 


Finally  we  have  that  for  short  data  sets,  prediction  error  power  analysis  in  conjunction  with  the  Nuttall 
upper  bound  provide  a  crude  indicator  for  the  model  order  to  use.  In  the  next  section  a  similar  analysis  and 
r.esult  is  presented  for  actual  return  data  sets. 


Prediction  Error  Power 


Model  Order  (p) 


Figure  13:  Independent  realizations  for  64  data 
points 


Prediction  Error  Power 


Model  Order  (p) 


Figure  14:  Independent  realizations  for  128  data 
points 


Prediction  Error  Power  Prediction  Error  Power 


Model  Order  (p)  Model  Order  (p) 

Figure  15:  Independent  realizations  for  200  data  Figure  16:  Independent  realizations  for  550  data 


points  points 

5.4  MCARM  data 

This  section  presents  simulations  using  Multi-Channel  Airborne  Radar  Measurement  (MCARM)  data.  A 
typical  AIC  curve  for  this  type  of  data  is  shown  in  Figure  17.  Notice  that  this  curve  does  not  show  the  ex¬ 
pected  concave  behavior  since  the  data  has  relatively  short  time  support  (128  time  samples  and  14  channels). 


The  figure  also  shows  the  Nuttall  upper  bound. 

Figures  33  through  47  are  typical  prediction  error  plots  for  different  range  cells.  Notice  that,  as  expected, 
in  all  of  these  cases  the  prediction  error  power  decreases  with  respect  to  the  model  order.  This  does  not 
mean  that  the  prediction  error  power  yields  the  proper  MAR  model  order.  Also  notice  that  in  some  cases 
the  prediction  error  power  decreases  from  about  2dB  (Figure  33)  to  about  lOdB  (Figure  38).  This  might 
indicate  that  using  a  high  model  order  will  lead  to  a  more  accurate  prediction  of  the  data.  However,  as 
indicated  by  the  upper  bound  AIC  analysis,  the  upper  limit  for  the  model  order  order  should  not  exceed 
1.7  for  the  time  support  considered  in  these  simulations.  Moreover,  if  higher  model  orders  are  considered 
then  we  have  an  increase  in  variance  which  for  the  MAR  model  leads  to  undesired  spurious  peaks  in  the 
spectrum.  This  is  reflected  in  the  2D“Spectral  plots  20  through  31  shown  below  for  model  order  2  and  10. 
These  2D-spectral  plots  were  computed  as  described  in  section  3. 

Akaike  Information  Criterion 


Figure  17:  Typical  AIC  for  MCARM  data 
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Figure  18:  2nd  order  MAR  for  acquisition  #465, 
range  cell  200 


-0.5  -0.4  -0.3  -0.2  -0.1  0  0.1  0.2  0.3  0.4  0.5 

Bearing  Space 

Figure  19;  10th  order  MAR  for  acquisition  #465, 
range  cell  200 


Periodogram 


Bearing  Space 


Figure  20:  Periodogram  for  acquisition  #465,  range  cell  200 


Bearing  Space 


Figure  21:  2nd  order  MAR  for  acquisition  #520,  Figure  22:  10th  order  MAR  for  acquisition  #520, 
range  cell  200  range  cell  200 


Periodogram 


Bearing  Space 


Figure  23:  Periodogram  for  acquisition  #520,  range  cell  200 
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2D  Spectrum,  p  =2 


Bearing  Space 
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Figure  24;  2nd  order  MAR  for  acquisition  #575,  Figure  25;  10th  order  MAR  for  acquisition  #575, 
range  cell  200  range  cell  200 


Periodogram 
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Bearing  Space 

Figure  26;  Periodogram  for  acquisition  #575,  range  cell  200 
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2D  Spectrum,  p  =2  2D  Spectrum,  p  =  10 


Figure  27:  2nd  order  MAR  for  acquisition  #628,  Figure  28:  10th  order  MAR  for  acquisition  #628, 
range  cell  200  range  cell  200 


Periodogram 
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Figure  29:  Periodogram  for  acquisition  #628,  range  cell  200 
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Figure  30:  2nd  order  MAR  for  acquisition  #644, 
range  cell  200 


Bearing  Space 

Figure  31:  10th  order  MAR  for  acquisition  #644, 
range  cell  200 


Periodogram 


Bearing  Space 


Figure  32:  Periodogram  for  acquisition  #644,  range  cell  200 


Figure  38:  PEP  (acquisition  #520,  range  cell  350) 


Model  Order  (p) 


Model  Order  (p) 


Figure  39:  PEP  (acquisition  #575,  range  cell  105)  Figure  40:  PEP  (acquisition  #575,  range  cell  200) 
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Model  Order  (p)  Model  Order  (p) 

Figure  42:  PEP  (acquisition  #628,  range  cell  105)  Figure  43:  PEP  (acquisition  #628,  range  cell  200) 
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Figure  45:  PEP  (acquisition  #644,  range  cell  105)  Figure  46:  PEP  (acquisition  #644,  range  cell  200) 
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Model  Order  (p) 


Figure  47:  PEP  (acquisition  #644,  range  cell  350) 
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5.5  MCARM  PEP  for  Independent  Realizations 

The  following  plots  were  generated  as  described  by  Figure  12  in  section  5.3,  where  the  independent  data 
realizations  are  related  to  different  range  cells  for  a  particular  data  acquisition.  Notice  that  for  this  data, 
obtaining  independent  realizations  of  the  process  is  impossible.  However,  if  one  assumes  that  different  range 
cell  data  comes  from  a  similar  process,  then  the  use  of  different  range  cell  data  sets  may  approximate  the 
independent  realization  assumption. 

Recall  that  the  idea  behind  this  experiment  is  to  verify  that  the  average  PEP  for  a  set  of  independent 
data  realizations  varies  substantially  (away  from  the  PEP  of  the  range  cell  under  study)  from  the  ^4/(7  results 
above  the  Nuttall  upper  bound.  This  suggests  that  the  MAR  parameters  for  one  range  cell  applied  to  an 
independent  range  cell  data  set,  from  the  similar  process,  have  a  relatively  small  variance  for  model  orders 
less  than  the  Nuttall  limit.  This  type  of  analysis  provides  a  crude  indicator  for  the  model  order  to  use. 

The  data  under  study  (o  -  solid  line)  is  that  of  range  cell  200  for  the  different  acquisitions  and  the  average 
prediction  error  power  (x  -  solid  line)  was  taken  over  9  independent  (different  range  cells)  realizations  (128 
time  samples  and  14  channels).  In  particular,  the  independent  realizations  (dashed  lines)  are  those  of  range 
cells  120,  140,  160,  180,  220,  240,  260,  280,  and  300.  Notice  that  all  the  plots  show  similar  behavior  as  that 
of  the  simulated  data.  That  is  the  set  of  independent  realizations  varies  dramatically  from  the  average  PEP 
after  a  relatively  small  model  order,  except  for  acquisition  #520  which  does  not  show  this  general  behavior. 
This  might  be  attributed  to  the  terrain  for  which  this  data  was  obtained.  However,  in  general,  we  can  assert 
from  these  simulations  that  a  low  model  order  MAR  process  can  be  used  for  the  MCARM  data  set. 
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6  Conclusion 


This  work  has  focussed  on  the  model  order  selection  problem  for  airborne  radar  data  in  a  innovations  based 
detection  setting.  For  proper  detection  processing,  it  is  imperative  that  the  signal  model,  here  a  multichannel 
autoregressive  process  (MAR),  is  capable  of  capturing  the  statistical  characteristics  of  the  clutter.  To  this 
end,  the  model  order  (p)  must  be  large  enough.  However,  the  choice  of  a  model  order  that  is  too  large  will 
be  detrimental  to  detection  processing  due  to  the  associated  large  parameter  variance.  To  provide  proper 
model  order  selection  we  have  looked  at  two  (somewhat  related)  criteria,  the  Akaike  Information  Criterion 
(AIC)  and  prediction  error  power.  These  criteria  were  investigated  using  both  a  physical  radar  signal  model 
from  SSC,  and  actual  MCARM  radar  measurements. 

The  simulations  using  independent  realization  assumptions  indicate  that  a  MAR  process  with  low  model 
order(p  =  2  or  p  =  3)  might  best  be  used  for  innovations  based  detection.  The  independent  realizations 
simulations  also  verify  the  Nuttall  upper  bound  for  AIC  confidence  from  the  prediction  error  power  view 
point.  Results  based  on  simulated  data  (which  allow  independent  data  realizations)  indicate  that,  while 
large  model  orders  achieve  a  reduction  in  prediction  error  power,  this  comes  at  the  expense  of  excessive 
parameter  variance.  In  turn,  this  leads  to  a  large  prediction  error  power  for  independent  data  sets  serving 
as  an  indication  of  a  too  high  MAR  model  order.  In  general,  for  the  MCARM  data  sets,  we  have  that  the 
average  PEP  for  a  set  of  independent  realizations  varies  substantially  after  a  relatively  small  model  order. 

The  study  of  the  development  of  the  standard  AIC  indicates  that  this  model  estimator  should  be  used 
with  care  for  radar  return  data.  That  is,  the  common  assumptions  in  derivation  of  the  AIC  included:  large 
data  sets,  real  (i.e.  non-complex)  data,  and  Gaussianly  distributed.  These  assumptions  do  not  typically  hold 
for  actual  radar  returns,  which  is  complex  and  has  a  short  temporal  support. 

The  MAR  2D-spectrum  simulations  included  in  this  study  show  that  a  relative  high  model  order  results  in 
spurious  peaks  while  a  low  model  order  (p  =  2)  is  capable  of  describing  the  clutter  (from  a  visual  comparison 
with  the  periodogram)  in  agreement  with  the  PEP  results.  These  results  imply  that  a  low  order  MAR 
process  in  an  innovations  based  detection  architectures  may  be  used  for  target  detection. 
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Abstract 

We  studied  the  mathematical  theory  inspired  by  the  problem  of  multi-source 
direction  finding  using  a  phased  array  antenna.  We  studied  two  approaches 
for  solving  the  problem:  the  orthogonal  polynomial  approach  and  the  wavelet 
approach.  The  first  apporach  provides  very  accurate  predictions,  and  requires 
the  theoretically  minimal  number  of  antenna  elements,  but  is  relatively  un¬ 
stable  under  noise.  The  wavelet  approach  requires  a  large  number  of  an¬ 
tenna  elements,  which  can  be  simulated  by  a  synthetic  aperture  raxlax.  This 
approach  also  provides  accurate  predictions,  and  is  extremely  stable  under 
noise. 
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Multi-source  direction  finding 


H.  N.  Mhaskar 


1  Introduction 

A  phased  array  antenna  consists  of  several  elements,  each  producing  an  (al¬ 
ternating)  electric  current  when  hit  by  electromagnetic  radiation.  The  prob¬ 
lem  of  direction  finding  consists  of  finding  the  direction  of  arrival  of  the 
radiation  from  the  measurements  of  the  currents.  If  the  radiation  is  coming 
from  a  single  source,  and  the  antenna  is  in  a  good  condition,  then  there  are 
relatively  simple  and  well  understood  methods  for  direction  finding.  The 
problem  is  extremely  difficult  if  there  are  more  than  one  sources,  and  one 
needs  to  estimate  the  direction  in  which  each  is  located  relative  to  the  an¬ 
tenna.  Because  of  the  great  interest  in  this  problem,  there  are  many  well 
known  algorithms  to  solve  this  problems;  a  survey  can  be  found,  for  exam¬ 
ple,  in  [3].  However,  there  is  practically  no  prior  work  on  the  mathematical 
theory  governing  this  problem.  The  objective  of  the  grant  is  to  begin  building 
such  a  mathematical  theory,  which  would  also  lead  to  better  algorithms. 


2  Formulation  of  the  problem 

Let  A:  >  0  be  an  integer.  A  (narrow-band)  radiation  incident  on  the  k- 
th  antenna  element  produces  a  current  of  the  form  \exp{i{ku  +  0)),  where 
A  >  0  denotes  the  amplitude  of  the  current,  (j)  is  the  initial  phase,  and  u  is 
related  to  the  direction  of  arrival  9  of  this  radiation  by  the  formula 

u  =  /j  sin  6, 
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where  //  is  a  constant  depending  upon  the  array  parameters  and  the  frequency 
of  the  radiation.  Therefore,  if  there  are  m  signals  impinging  on  the  antenna  at 
angles  Oi,  -  ,9m,  with  amplitudes  Ai,  •  •  • ,  A^,  and  initial  phases  (pi,  -  ■  ■  ,(j)m, 

then  the  resulting  current  in  the  k-th  element  is  given  by 

m  m 

H  Xiexp{i{kue  +  0^))  =  exp(iA:u£),  (2.1) 

e=i  t=i 

where  Uf  =  fjismOi,  £  =  The  problem  of  multi-source  direc¬ 
tion  finding  is  to  find  the  values  oi  ue,  £  =  given  the  vector 

V  :=  (uo,  •  ■  • ,  u/v-i),  where  N  is  the  number  of  antenna  elements.  We  observe 
that  the  amplitudes  ut's  can  be  computed  easily  once  the  u/s  are  known. 

In  practice,  the  observations  v  are  subject  to  two  kinds  of  noise.  A 
realistic  model  is  given  by 

X  :=  (rro,  •  •  • ,  x;v-i)  5i(t)v  -f  S2(t),  (2.2) 

where  t  denotes  the  time  at  which  the  observation  was  taken,  Si{t)  is  a 
complex  valued,  normally  distributed  random  variable  with  mean  1,  and 
S2(t)  =  (s2,o(0)  ■  ■  ■ )  S2,N-i(0)  is  a  vector  of  complex  valued,  normally  dis¬ 
tributed  random  variables  with  mean  0.  The  signal-to-noise  ratio  in  this 
model  at  any  time  depends  upon  both  the  magnitudes  of  the  noises  Si  and  S2, 
as  well  as  the  amplitudes  cj^’s.  In  our  investigations  so  far,  we  have  adopted 
the  simplifying  normalization  that  |si(<)|  =  1.  With  this  normalization,  our 
model  becomes 


m 

Xk  =  ^cx;/exp(z(A:u/-h0f(t)))  +  S2,fc(0)  (2-3) 

i=i 

where  ipt's  are  real  valued,  normally  distributed  random  variables  with  mean 
0,  and  S2(t)  is  as  before. 

In  summary,  the  problem  is  to  determine  the  quantities  u^,  £  =  1,  -  ■  •  ,m, 
given  X  :=  (xq,  •  •  • ,  x^v-i)-  We  explored  two  different  approaches  towards 
this  problem,  which  we  will  call  the  orthogonal  polynomial  approach,  and  the 
wavelet  approach. 
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3  The  orthogonal  polynomial  approach 

In  this  section,  the  Stieltjes  integral  [7]  provides  a  convenient  notation.  Let 
vx;  be  a  complex  measure  that  associates  the  (complex)  mass  Ui  with  each  of 
the  points  zi  :=  exp{iu£),  I  =  1,  •  •  •  ,m.  For  the  time  being,  let  us  ignore 
the  noises  and  consider  the  “pure  model”  (2.1).  Using  the  notation  of  the 
Stieltjes  integral,  we  now  see  the  quantities  to  be  the  “moments”  of  this 
complex  measure: 


=  I  k  =  0,---,N-l. 


P{z)  :=  n(^  -  =z^-Yl  apz^-^.  (3.2) 

f=i  p=i 

Our  notation  implies  that 

j  f{z)P{z)dui{z)  =  0  (3.3) 

for  any  function  /  :  {zx,---,Z(]  — ^  C.  Substituting  the  monomials  z'^  in 
place  of  /,  we  obtain  that 

I  z^P{z)du{z)  =  j  z^+'^dwiz)  -  Eflp/  z'^-P+'^du{z)  =  0;  (3.4) 


'^m+k  —  ^  p+fe)  k  —  0,  •  •  •  ,  iV  772.  (3-5) 

p=l 

It  can  be  shown  that  a  unique  solution  of  the  direction  fininding  problem 
requires  N  >  2m— 2.  For  example,  if  772  =  3  and  ^  =  1, 2, 3  are  any  distinct 
points,  then  any  observations  uo,Ui,U2  can  be  realized  with  a  proper  choice 
of  uJiS.  More  generally,  let  4>  be  any  point  in  the  range  0  <  0  <  ■K/{2m),  or 
TT  —  -n /{2m)  <  (f)  <  TT.  It  can  be  shown  (cf.  [1],  Section  1.3)  that  there  exist 
772  positive  numbers  \e{<p)  and  distinct  points  itf(</>)  G  [0,  tt)  such  that 


'£WWM<t‘))=  /  m<io 
0 _ 1  •'0 
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for  any  cosine  polynomial  R  of  degree  at  most  2m  —  2.  The  point  0  is  one  of 
the  points  ue{(p).  We  now  consider  the  points 


ze{<P)  =  ( 


if  £  =  1,  ■  •  • ,  m, 
if  £  =  m  +  1,  •  •  • ,  2m. 


For  ^  =  m  +  1,  •  •  • ,  2m,  we  write  A^(0)  =  \t-rn{4>)-  Finally,  we  write  u;£(0)  = 
It  is  not  difficult  to  check  that 


Y^u„(4,)R{z,{,t,))  =  / 

<=1 


for  any  algebraic  polynomial  R  of  degree  at  most  4m  —  4.  In  particular, 
choosing  R  to  be  the  monomials  z'^,  k  =  0,  •  ■  •  ,4m  —  4,  we  see  that  any 
points  0  in  the  range  [0, 7r/(2m))  (or  tt  —  it /{2m)  <  0  <  tt)  will  give  rise  to 
the  same  observations  u*,.  A:  =  0,  ■  •  • ,  4m  —  4.  In  particular,  N  =  2(2m)  —  3  is 
not  enough  to  uniquely  determine  each  of  the  2m  points  Zf’s.  Actually,  the 
choices  0  =  0  and  0  =  tt  indicate  that  one  needs  2m  moments  for  a  unique 
determination  in  all  cases. 

With  N  =  2m,  we  may  apply  the  equation  (3.4)  for  k  =  0,  ■  ■  •  ,m  —  1. 
Thus,  the  polynomial  P  is  “orthogonal”  to  all  polynomials  of  degree  at  most 
m  —  1.  This  is  the  reason  for  calling  this  approach  the  orthogonal  polynomial 
approach.  We  observe,  however,  that  the  integral  expression  in  (3.4)  is  not 
a  proper  inner  product.  Hence,  the  theory  of  orthogonal  polynomials  is  not 
directly  applicable  here.  In  particular,  the  matrix  of  the  system  of  linear 
equations  (3.5)  is  not  positive  definite.  Nevertheless,  the  system  is  solvable 
for  almost  all  values  of  uje's. 

This  suggests  the  following  method  for  the  solution  of  the  direction  finding 
problem.  Given  the  moments  Vk,  k  =  0,  •  •  ■ ,  2m  —  1,  we  solve  the  system  of 
equations  (3.5)  for  the  quantities  Cp.  The  roots  of  the  polynomial  (3.2)  then 
give  us  the  values  of  2^’s,  from  which  the  directions  can  be  computed  easily. 
The  advantage  of  this  approach  is  that  there  are  no  limitations  on  how  close 
the  angles  of  arrival  can  be,  except  for  the  limitations  of  the  computer  and 
the  algorithms  to  find  the  roots  of  a  polynomial.  The  disadvantage  is  that 
the  mapping  v  — >  (zi,  •  •  • ,  Zm.)  is  notoriously  unstable,  (cf.  [2]). 

In  the  presence  of  noise,  one  therefore  needs  a  greater  value  of  N  than 
2m  —  1.  The  system  (3.5)  is  then  overdetermined.  We  may  then  take  several 
samples  (looks)  and  obtain  the  Vk's  by  averaging.  The  coefficients  Cp  can  then 
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be  found  by  a  least  square  fit.  Moreover,  the  singular  value  decomposition  of 
the  matrix  provides  an  educated  guess  about  the  number  of  angles  involved. 
In  our  experiments,  this  turned  out  to  be  a  reasonably  stable  algorithm, 
yielding  answers  with  a  high  accuracy,  although  the  number  of  looks  was 
very  high. 


4  The  wavelet  approach 


We  use  the  notation  of  the  Stieltjes  integral  also  to  describe  this  approach. 
Again  igonoring  the  noise  for  the  time  being,  let  uj  now  denote  the  com¬ 
plex  measure  on  [— 7r,7r]  that  associates  the  mass  with  the  point  ui, 
£  =  1,  •  •  •  ,m.  The  observations  Vk  can  then  be  written  as  the  trigonometric 
moments  of  this  complex  measure: 

Vk  =  I  e^'^^duj{9),  k  =  0,---,N  -1. 


The  “farfield  computation”  consists  of  computing  the  Fourier  sum  Y!,k=o 
This  sequence  does  not  converge  as  V  — >  oo.  However,  for  integer  r  >  1,  the 


series 


--ifcg 

2^  j^r+l^ 


(where  Y!  means  that  the  term  fc  =  0  is  excluded)  converges  uniformly  and 
absolutely  to  a  function  with  r  piecewise  continuous  derivatives.  The  points 
ug  now  appear  as  the  points  of  discontinuity  of  the  derivative  of  order  r  of 
this  function. 

Wavelets  are  now  extremely  popular  tools  to  detect  singularities  of  a 
function  and  its  derivatives.  In  our  situation,  the  classical  wavelet  transforms 
cannot  be  computed  efficiently.  Along  with  J.  Prestin,  we  developed  [4], 
[5],  [6]  trigonometric  and  polynomial  frames  that  axe  calculated  using  the 
moments  of  the  target  function,  and  can  detect  singularities  of  all  order 
derivatives.  In  the  present  context,  our  frames  take  the  form 


N-l 

T. 


k-N/2 


Vk 

-j^9k,Ne 


-ike 

J 


(4.1) 
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where 


f  2nk  \ 

=  !>[-nTi) 

for  a  suitably  chosen  function  g.  We  have  proved  in  [6]  that  the  quantity 
is  “large  near”  the  points  Ui,  and  “small  away”  from  these.  In  fact, 
we  have  established  precise  asymptotics  for  in  terms  of  g. 

In  our  experiments,  we  used  the  function  g  defined  on  [0, 27r]  by 

ifa:e[0,7r], 

\  (x  -  7r)^(27r  -  x)^,  if  X  6  [tt,  27r], 

and  extended  to  the  whole  real  line  as  a  27r-periodic  function.  The  value 
of  r  seemed  to  be  irrelevant;  we  used  the  value  r  =  3.  In  contrast  to  the 
orthogonal  poKmomial  approach,  this  method  is  substantially  more  stable 
under  noise;  we  could  detect  the  angles  of  arrival  even  with  a  SNR  of  —9.bdB. 
It  does  require  a  very  high  value  of  N.  We  observe  that  only  N/2  values  of 
Vk  are  used  in  the  calculation  of 

To  illustrate  the  relative  merits  and  demerits  of  the  two  methods,  we 
considered  the  incidence  angles  10°  and  10.01°,  with  no  noise,  and  amplitudes 
1.  With  4  elements,  the  orthogonal  polynomial  method  gave  the  predictions 
10.0093°  and  9.9991°.  The  farfield  computation  with  256  elements  and  the 
frame  transform  ^512  (which  also  uses  256  moments)  are  shown  in  Figure  1. 
(In  these  figures,  the  corresponding  u-  values  are  on  the  x  axis;  the  values  of 
interest  are  0.5455  and  0.5461  respectively.)  It  is  seen  that  neither  of  them 
is  capable  of  this  resolution. 

Again,  we  took  the  angles  10°  and  10.3°,  with  amplitudes  1.  With  an 
additive  noise  of  variance  0.01  and  10,000  looks,  the  orthogonal  polynomial 
method  with  8  elements  gave  the  predictions  of  10.1345°  and  11.6895°  re¬ 
spectively.  The  farfield  computation  and  the  frame  transform  in  Figure  2 
were  computed  with  a  variance  of  3  and  just  one  look.  The  u-values  of  in¬ 
terest  here  are  0.5455  and  0.5617.  Clearly,  the  wavelet  approach  works  much 
better. 

Finally,  we  took  the  incidence  angles  10°  and  15°,  with  an  additive  noise 
of  variance  1  and  amplitudes  1.  With  8  elements  and  10,000  looks,  the 
orthogonal  polynomial  approach  gave  the  predictions  of  10.1950  and  15.1915. 
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5  Conclusions 


We  have  studied  two  approaches  for  the  solution  of  the  multi-source  direction 
finding  problem.  The  orthogonal  polynomial  approax:h  gives  very  accurate 
predictions  with  a  small  number  of  antenna  elements,  with  no  limitations  on 
how  close  the  incidence  angles  can  be.  However,  it  is  very  unstable  under 
additive  noise.  The  predictions  are  still  reliable  if  the  separation  is  near  the 
higher  ranges  of  super-resolution,  but  the  number  of  looks  required  is  very 
large.  In  contrast,  the  wavelet  approach  requires  a  large  number  of  antenna 
elements,  which  can  probably  be  arranged  with  a  synthetic  aperture  radar, 
but  is  very  stable  under  noise,  and  requires  only  one  look.  The  frames  being 
trigonometric  polynomials,  there  are  theoretical  limitations  on  the  resolution 
that  can  be  achieved,  but  frames  using  the  same  amount  of  information  are 
capable  of  higher  resolution  than  the  classical  farfield  computation. 
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Abstract 


The  rapid  and  efficient  development  of  reliable  software,  especially  large  software, 
is  a  task  for  which  humans  are  not  particularly  well  suited.  Still,  humans  possess  a  broad 
expertise  in  programming  and  software  design  that  has  resisted  attempts  to  automate  the 
task.  Important  to  the  present  study  is  the  human  capacity  to  recognize  the  intentions  of 
software  separate  from  the  code  implemented.  Recognition  of  intentions  is  defined  as  the 
ability  to  categorize  software  into  coherent  goal  related  groups,  such  as  financial,  database, 
or  system  software.  The  author  in  the  past  created  a  software  evaluation  and  recognition 
system  from  the  combination  of  two  recent  cognitive  science  techniques  in  recognition  and 
categorizing.  One  technique  recognizes  objects  in  a  vastly  multi-dimensional  space  using 
eigenvectors,  and  the  other  technique  automatically  processes  text  objects  using  n-grams. 
The  investigation  demonstrated  how  applying  these  techniques  can  create  a  coherent  space 
for  recognizing  software  source  code.  The  present  study  joined  the  software  recognition 
system  with  an  evolutionary  computational  system  to  create  an  adaptive  and  bootstrappable 
system  that  shows  promise  for  source  code  categorization.  Such  a  system  could  be  used  in 
a  search  or  browser  system  that  would  immediately  benefit  to  human  programmers  who 
review  legacy  software.  Finally,  the  approach  is  generalizable  to  a  large  domain  of 
evaluation  and  recognition  tasks  (such  as  audio,  image,  logic  and  text)  because  the  methods 
use  an  informational  approach  based  on  atomic  (frequency  based),  not  featural  (object 
based),  representations. 
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AN  EVOLUTIONARY  SYSTEM  FOR  MACHINE  RECOGNITION 
OF  SOFTWARE  SOURCE  CODE 


Ronald  W.  Noel 


Introduction 


Software  engineering  exists  in  the  throes  of  a  cognitive  science  dilemma.  On  one  hand,  the 
engineering  of  software  is  a  very  human  process.  It  requires  broad  expertise  in 
programming,  software  design,  and  the  awareness  of  the  goals  and  intentions  of  the 
software  system.  For  the  most  part,  these  human  processes  are  poorly  understood  by 
cognitive  scientists  and  have  not  succumbed  to  software  automatization  techniques.  The 
inability  to  automate  such  tasks  has  left  software  engineering  to  humans.  But,  the  creation 
of  software,  especially  large  software,  is  not  a  task  for  which  humans  are  well  suited.  The 
need  to  think  in  exact  detail  and  to  consider  a  large  number  of  simultaneously  interacting 
factors  limit  the  scope,  the  size,  and  the  reliability  of  the  software  that  a  human  or  group  of 
humans  can  produce. 

The  most  reasoned  approach  to  attack  this  dilemma  is  the  judicious  division  of  software 
programming  tasks  between  humans  and  computers.  In  such  an  approach,  humans  would 
work  in  broad  terms  suited  to  their  abilities  to  manage  the  goals  and  intentions  of  a 
program.  Humans  could  specify  the  program  in  terms  of  actions  and  purposes.  The 
specifications  of  the  program  could  then  be  turned  into  code  by  using  formal  methods.  One 
would  expect  that  the  creation  of  such  code  would  be  of  a  much  higher  quality  than 
anything  that  humans  could  produce  unaided.  Indeed,  such  automated  coding  from  a 
computer  based  design  management  system  is  the  approach  that  describes  much  of  the 
work  and  success  of  knowledge  based  system  engineering  [8,  9]. 
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Further  success  in  raising  the  quality  of  software  will  rely  on  the  computer  taking  over 
more  and  more  of  the  human  tasks.  The  first  tasks  to  consider  are  those  that  comprise  the 
better  understood  cognitive  tasks,  especially  the  mundane  ones.  Automating  such  tasks 
would  free  the  humans  to  work  on  other  tasks.  The  benefits  to  software  engineering 
would  include  greater  job  satisfaction  for  the  human,  a  reduction  in  development  time,  and 
an  increase  in  software  consistency  (i.e.,  software  quality).  The  level  of  performance 
might  be  lower  than  that  of  the  human  expert  for  the  same  task,  but  the  benefits  could  be 
obtained  as  long  as  the  software's  performance  was  near  the  average  for  human 
performance.  Additionally,  because  automated  software  capabilities  are  independent  of 
humans,  the  expertise  would  not  be  lost  through  the  attrition  of  normal  work  environment. 
Instead,  the  software  could  be  developed  as  a  separate  component  and  studied  and 
improved  towards  the  long  term  goals  in  computer  aided  software  engineering  system. 

This  paper  investigates  a  system  which  will  recognize  the  intentions  of  existing  legacy 
software  for  use  in  the  development  of  the  functional  specifications  of  a  future  software. 
The  system  will  be  a  pattern  recognizer  capable  of  assigning  software  to  coherent  goal 
related  groups  [10].  Applications  for  the  recognizer  might  include  a  filter  in  a  browser,  a 
prefilter  that  prioritizes  software  for  an  automated  recovery  system,  a  tool  that  aides  in  the 
segmentation  of  software  into  meaningful  chunks,  and  a  method  for  determining  the 
intentions  of  a  software  artifact  use  in  automated  processes.  Consider,  when  faced  with  the 
task  of  reviewing  a  vast  amount  of  software,  a  human  would  first  make  a  series  of  quick 
initial  judgments  based  on  holistic  pattern  recognition  to  eliminate  as  many  candidates  for 
use  as  possible,  and  prioritize  any  software  kept.  The  remaining  software  would  have  to 
go  through  some  detailed  analysis  to  determine  if  the  software  was  suitable.  Software  that 
passes  the  quick  judgment  relies  on  human  holistic  pattern  recognition  ability.  Since  these 
are  the  capabilities  that  the  filter  are  to  emulate  a  brief  discussion  of  human  holistic  pattern 
recognition  will  be  valuable. 
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Holistic  Recognition 


A  fundamental  schism  exists  between  human  intellectual  abilities  and  machine  processing 
capacities.  Humans  are  open  systems  which  tend  to  act  in  very  holistic  and  non- 
deterministic  ways.  On  the  other  hand,  machines  are  closed  systems  better  suited  for 
analysis  in  a  limited  domain.  Humans  lack  the  tremendous  numerical  computational  speed; 
yet  they  can  process  information  holistically  in  an  automatic,  rapid,  and  natural  manner. 
Machines  possess  tremendous  computational  capabilities;  yet  no  algorithm  exists  to 
perform  holistic  processes  as  well  as  humans  do.  In  general,  these  differences  have  led  to 
an  antagonistic  interface  between  humans  and  computers,  however,  these  same  differences 
could  lead  to  a  beneficial  synergism.  Ideally,  a  good  interactive  system  would  integrate  the 
best  human  qualities  with  machine  computational  capabilities  enabling  it  to  outperform 
either  of  the  two  cognitive  components  alone  [7], 

The  theory  regarding  holistic  processing  can  be  separated  into  two  stronger  and  weaker 
stances.  Under  the  weaker  stance,  features  (meaningful  units  separate  from  context) 
interact  with  each  other  through  configural  processes  to  form  emergent  properties  or 
"second-order  relational  features".  Under  the  stronger  stance,  the  process  is  completely 
holistic  in  that  its  representation  is  non-decomposable,  and  no  explicit  description  of 
features  exists  outside  the  context.  These  stances  provide  two  ways  to  understand  systems 
to  support  holistic  representation:  The  traditional  approach  to  understanding  holism  is  to 
seek  a  system  of  features  and  relationship  that  combine  in  meaningful  and  unexpected  ways 
to  capture  attributes  of  configurations  [1 1],  A  newer  approach  is  to  seek  a  system  which 
endeavors  to  understanding  the  relationships  between  configurations  without  the 
decomposition  into  features  [12], 
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Cognitive  research  with  humans  tends  to  support  the  use  of  the  newer  approach  to  holism 
when  dealing  with  expertise,  A  well-known  area  in  which  cognitive  researchers  study 
these  holistic  processes  is  the  recognition  of  objects  and,  in  particular,  faces.  Theories 
regarding  the  recognition  of  objects  and  faces  have  been  distinguished  by  different 
perceptual  encoding  and  representational  processes.  Much  of  basic  object  recognition 
theory  has  been  based  on  the  decomposition  of  parts  and  the  analysis  of  edge  features  [12, 
13, 14],  On  the  other  hand,  face  recognition  theory  has  been  based  on  stronger  holistic 
processes  which  utilize  holistic  surface  characteristics  such  as  texture,  color,  and  shading 
[15],  Important  to  the  present  paper  is  that  research  suggests  that  the  distinctions  between 
object  and  face  recognition  begin  to  fade  when  one  examines  expert  object  recognition[  1 1], 
Experts  appear  to  use  utilize  the  stronger  holistic  processes  similar  to  those  found  in  face 
recognition. 

Machine  Categorization  of  Holistic  Representations 

Machine  categorization  of  objects  is  studied  under  many  banners.  Each  banner  represents 
either  a  domain  of  objects  to  be  classified  (i.e.,  images  versus  text),  process  (i.e.  feature 
analytic  versus  template  matching),  representation  (i.e.  neural  networks  versus 
propositional),  or  any  combination  of  the  three.  Considering  such  broad  topics  as  data 
modeling,  object  recognition,  and  text  comprehension  one  finds  that  a  dominant  approach 
is  the  decomposing  of  complex  representations  into  a  small  number  of  components  or 
features  along  with  a  limited  number  of  possible  relationships.  The  features  and  their 
relationships  are  then  captured  in  a  parameterized  model  that  can  easily  be  manipulated  or 
"understood"  by  algorithms  for  computer  use.  For  our  purposes,  a  feature-based 
decomposition  method  for  developing  a  filter  would  be  to  select  a  set  of  key  words  and 
phases  and  a  set  of  rules  that  would  categorize  based  on  the  presence  and  absence  of  the 
words  and  phases  in  a  given  text. 
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However,  problems  occur  with  such  an  approach  since  it  is  open  ended  and  requires  either 
a  human  analyst  to  hand  craft  the  solutions  or  tailor  make  automated  approaches  to  pick 
features  and  discover  rules  for  relationships.  The  difficulty  of  this  is  that  it  is  usually  not 
clear  what,  if  any,  set  of  features  will  decompose  the  representation  coherently. 

Remember,  a  representation  that  cannot  be  decomposed  into  a  limited  set  of  features  and 
relationships  is  said  to  be  holistic.  Such  holistic  representations,  however,  do  succumb  to 
atomic  or  frequency  level  decomposition,  and  usually  it  is  the  atomic  symbols  that  are 
manipulated  on  a  machine.  The  author  suggest  that  software  source  code  which  is  created 
by  humans  is  one  of  these  non-decomposable  representations  when  ii  comes  to  semantic 
classification  or  categorization. 

Human-computer  interaction  commonly  uses  atomic  representation  of  objects,  as  shown  in 
the  following  examples:  (1)  A  word  processor  stores  and  manipulates  the  graphemes  or 
visual  symbols  of  natural  language.  Through  the  manipulation  of  the  graphemes,  the 
program  enables  humans  to  represent  human  thought  in  text.  (2)  Graphics  programs 
create,  manipulate,  and  store  pixel  definition  of  points  of  light.  Though  the  manipulation  of 
the  pixels,  the  program  enables  humans  to  represent  images  and  visual  perceptions.  (3) 
Through  the  manipulation  of  program  primitives,  a  programmer  can  tailor  a  machine 
actions  to  achieve  some  task  or  intention.  In  each  case,  the  machine  interacts  with  the 
human  at  an  atomic  level  of  representation  and  has  no  semantic  or  meaningful 
representation.  The  task  falls  to  the  human  to  organize  the  representations  though  an 
abstract  understanding  of  the  object  and  the  methods  of  structuring  the  atomic  elements  to 
represent  the  objects. 

In  the  programming  example,  the  atomic  level  of  representation  in  the  interaction  limits  the 
aid  the  application  can  offer  the  user.  The  human-computer  interface  could  be  improved  if 
computers  had  a  meaningful  representation  of  programs.  The  machine  could  be  used  for 
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source  code  data  mining  or  browsing  the  internet.  It  could  suggest  subroutines  to 
accomplish  the  tasks  needed  in  a  program,  determine  if  the  program  is  fulfilling  the 
programmer's  intentions,  and  determine  if  the  programmer  intentions  are  in-line  with 
higher  objectives.  It  could  offer  help  by  locating  similar  programs  and  determine  if  the 
program  corresponds  with  the  programmer's  intentions. 

Eigenfaces  in  Face  Space 

Building  a  machine's  capability  to  understanding  objects  created  from  atomic  representation 
requires  machine  methods  that  are  frequency  and  not  feature  based.  One  such  method  is 
contained  in  the  system  by  Turk  and  Pentland  [2]  at  the  Vision  and  Modeling  Group  part  of 
the  Media  Laboratory  at  the  Massachusetts  Institute  of  technology.  This  method  is  used  to 
identify  faces  from  pixel  encoded  pictures.  The  method  was  developed  as  an  extension  of 
earlier  work  by  Sirovich  and  Kirby  [3]  to  efficiently  represent  pictures  effaces  using 
principle-components  analysis.  Turk's  and  Pentland's  system  creates  a  face  space  based 
on  eigenvectors  to  decompose,  store,  and  recognize  face  images.  The  technique  is  a 
straight  forward  use  of  principle  components  analysis  except  for  an  early  step  of  image 
compression  where  the  best  subspace  is  built  based  on  coordinates  around  a  small  set  of 
exemplar  images,  termed  eigenpictures.  The  image  compression  greatly  reduces  the 
calculation  used  in  the  principle  components  analysis.  Without  image  compression,  the 
analysis  would  need  to  calculate  a  covariance  matrix  the  size  of  the  number  of  pixels  per 
picture  squared.  For  instance,  the  number  of  pixels  (N)  in  a  256  x  256  image  would 
require  calculating  a  matrix  of  the  size  (65,536)2.  Determining  the  eigenvectors  and 
eigenvalues  on  such  a  matrix  is  an  intractable  task. 

The  Turk  and  Penland  process  for  face  recognition  using  low  dimension  space  selecting  a 
training  set  (M)  of  pictures  and  create  the  vectors  I]. .  .1^  by  concating  the  pixel  intensity 
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values,  making  sure  that  the  members  of  the  set  are  normalized.  Normalization  is  required 
since  the  analysis  will  be  sensitive  to  any  differences  in  the  data  set.  For  example,  if  half 
the  pictures  have  dark  background  and  half  have  light  background  then  one  would  expect 
background  light  to  be  a  salient  dimension  in  the  resulting  analysis.  Given  that  background 
light  is  extraneous  to  the  task  of  face  recognition,  one  would  want  to  keep  the  background 
light  the  same  for  all  pictures.  A  large  portion  of  Turk  and  Pentland  [1,2]  face  recognition 
systems  is  front-end  processing  of  any  picture  to  remove  the  background,  center  the  face, 
scale  the  face  to  a  set  size,  etc. 

Next,  principal  components  are  determined  by  finding  the  M  eigenvectors  and  the 
eigenvalues  of  the  vectors'  covariance  matrix.  The  size  of  the  matrix  is  N^.  Presently 
performing  principle  component  analysis  of  data  sets  greater  than  one  hundred  is  difficult. 
Because  most  pictures  have  a  data  size  or  pixel  number  greater  than  one  hundred,  one  must 
use  a  technique  for  principle  component  analysis  developed  by  Sirovich  and  Kirby  [3]:  if 
the  number  of  eigenpictures  (M)  used  is  less  than  the  number  of  data  points  (N),  and  one 
subtracts  the  mean  for  the  data,  there  are  only  the  M-1  degrees  of  freedom  for  M-1  non¬ 
zero  eigenvectors.  The  analysis  uses  the  M  eigenpictures  to  create  a  M  dimensional 
subspace  of  the  possible  images.  Encoding  into  the  M  dimensional  subspace  reduces  the 
size  of  the  principle  component  analysis  from  N  to  M,  and  computational  size  of  the 
covariance  matrix  to  the  size  .  Since  M  (the  number  of  pictures  in  the  training  set  is 
usually  around  8  to  40 )  is  usually  much  smaller  than  N  (usually  in  the  tens  of  thousands), 
the  analysis  becomes  efficient. 

Recognition  is  accomplished  by  first  transposing  a  new  face  into  its  eigenface  components 
weights.  This  is  analogous  to  a  Fourier  transform  of  the  spatial  frequencies  contributing  to 
the  image.  Recognition  is  accomplished  by  comparing  a  newly  transformed  image  to  the 
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distance  between  the  image  and  the  exemplars,  or  other  stored  patterns.  The  new  pattern  is 
categorized  as  being  similar  to  the  closest  stored  pattern. 

The  eigenspace  technique  has  been  successful  with  images  but  its  application  to  C  source 
code  is  not  immediately  apparent.  Another  approach  to  categorizing  objects  by  the 
frequency  of  its  constituent  atomic  parts  is  the  n-gram  techniques.  Important  for  our 
purposes  is  that  the  technique  works  directly  on  natural  language  text.  Although  the 
technique  uses  only  the  letters  of  the  alphabet  and  the  space  symbol  it  can  be  directly 
generalized  to  C  source  code  when  one  adds  the  additional  symbols  used  in  the  language  C. 
Next,  we  will  examine  the  A-gram  approach. 

A-grams 

A-grams  encoded  the  atomic  symbols  of  text  into  n-character  sequences.  If  n  equals  one 
then  the  sequences  would  equal  the  single  letters  in  the  text,  for  n  =  2  the  bigrams,  3  the 
trigrams,  etc..  The  frequency  of  the  sequences  are  usually  collected  by  either  sliding  a 
"window"  of  the  size  of  the  n-gram  across  the  text  one  character  at  a  time.  For  example, 
the  word  "example"  would  generate  the  sequences  "exa",  "xam",  "amp",  "mpl",  "pie", 
"le_",  etc.  All  are  sequences  in  a  n  =  3  n-gram.  Using  n-grams  as  a  scoring  technique 
allows  modeling  the  statistical  nature  of  a  text  in  a  manner  that  is  robust  to  typographical 
errors,  garbling,  etc.  Some  of  the  techniques  score  only  the  n-gram  in  a  word  whereas 
others  score  across  inter-word  boundaries.  At  small  values  for  n,  the  n-grams  collect 
mainly  information  about  spelling  and  word  presence,  large  N’s  collect  information  on 
common  word  sequences  and  thus  start  representing  the  grammar  of  languages.  Indeed, 
the  technique  using  large  n-grams  can  be  used  to  generate  random  test  in  the  style  of  the 
author  on  which  the  n-grams  are  collected.  The  approach  is  language  independent  and  can 


13-10 


be  used  with  Japanese  and  Chinese  computer  text  based  on  a  16-bit  character  code  to 
designate  symbols. 

The  approach  was  developed  by  Damashek  [4]  who  used  a  n-gram  system  to  correct 
spelling  and  typing  mistakes  in  text.  Since  then,  a  complete  system  for  browsing 
documents  has  been  developed  by  Pearse  and  Nicholas  [5]  with  improvements  suggested 
by  Crowder  and  Nicholas  [6]  to  reduce  the  amount  of  information  required  for  a  usable 
system.  Systems  differ  in  implementation  but  have  the  following  general  features.  First, 
the  system  collects  the  n-gram  frequencies  using  a  sliding  "window"  for  a  text.  A  n  equal 
to  5  has  been  found  to  be  adequate  for  browsing  and  hypertexting  documents  [5].  The 
frequencies  are  then  normalized  by  dividing  the  n-gram  counts  by  the  total  number  of  n- 
grams  collected.  Then,  the  counts  are  centered  to  an  average  text  by  subtracting  the 
normalized  n-gram  frequencies  from  a  corpus  of  average  texts.  The  obtained  vector  is  the 
document  histogram  that  may  contain  thousands  of  entries.  At  this  juncture  one  can  use 
the  data  in  several  ways. 

The  Pearse  and  Nicholas  system  TELLTALE  [5]  determines  the  similarity  of  one  set  of  text 
to  another  based  on  the  normalized  document  n-gram  vectors  to  calculate  the  strength  of  the 
relationship  between  the  two  representations.  Also  they  index  text  by  a  query  based  on  a 
similarity  score  calculated  on  whether  a  document  or  query  contains  n-grams.  They  also 
offer  a  method  for  disambiguating  queries  by  specifying  the  context  for  a  query.  The 
context  emerges  from  the  intersection  between  two  similarity  scores:  the  similarity  of  the 
query  and  the  document  set,  and  the  similarity  of  the  current  document  and  the  document 
set.  The  disambiguation  of  the  set  requires  the  ad  hoc  setting  of  thresholds  for  the  two 
similarity  j  udgments. 
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Eigencodes in  Codespace 


Both  the  n-gram  and  the  Egenspace  approaches  seek  to  holistically  identify  groups  of 
objects.  Both  approaches  seek  to  capture  the  frequency  of  the  atomic  features  in  a 
representation.  The  difference  between  the  two  come  from  the  different  techniques  used  to 
determine  grouping.  To  date,  eigenfaces  are  formed  on  dimensions  extracted  from  the  data 
using  principle  components  analysis  with  grouping  determined  by  vectorial  differences  in 
the  space,  and  n-grams  determine  grouping  or  proximity  primarily  through  cluster  analysis 
or  proximity  based  on  least  squared  differences  of  features.  Cluster  analysis  methods  are 
used  to  identify  groups  of  objects  in  whatever  dimension  space  the  objects  are  in,  whereas 
factor  analytical  methods  are  used  primarily  to  reduce  the  dimensions  of  the  space  the 
objects  are  in.  Principle  components  analysis  has  the  advantage  of  reducing  a  large 
number  of  variables  to  a  smaller  number  of  variables  (locations  of  axis)  for  further 
analysis.  The  reduction,  if  done  appropriately,  reduces  noise,  gives  storage  economy,  and 
allows  the  identification  of  underlying  variables  or  those  dimensions  on  which  the  C  source 
code  varies. 

The  author  has  chosen  to  create  a  categorization  technique  for  C  source  code  that  use  the 
low  dimensional  eigenspace  technique  with  the  frequency  counts  of  the  n-gram  approach. 
This  should  be  achievable  since  both  techniques  uses  counts  or  intensities  of  atomic 
representations  in  a  vectored  format.  Given  that  the  data  are  comparable,  one  may  apply 
the  eigenface  technique  to  the  n-gram  counts  of  C  source  code.  In  other  words,  to  look  for 
eigencodes  in  a  code  space  and  analyze  the  reasonableness  of  the  space  for  representing 
code.  The  general  approach  is  to  make  the  codespace  require  the  following  steps: 

•  Acquire  a  small  representative  set  of  C  source  code  to  form  the  exemplar  set  upon 
which  to  build  an  eigenspace  for  the  code. 

•  Count  the  n-gram  frequencies  for  each  source  of  code,  and  place  them  in  a  vector. 
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•  Normalize  the  frequencies  by  dividing  by  the  total  number  hits  for  each  vector. 

•  Center  the  vector  around  the  average  vector. 

•  Apply  large  data  set  conversion  if  necessaiy. 

•  Do  a  principle  components  analysis  to  find  the  eigenvalues  and  eigenvectors  to  describe 
the  lower  dimension  subspace  of  the  eigenspace. 

To  test  the  concept  of  building  a  cognitive  filter  for  C  source  code,  the  author  selected  an 
approach  where  the  representation  of  the  code  would  be  captured  in  n-grams.  The 
encoding  of  source  code  into  n-grams  allows  the  code  to  be  expressed  in  atomic 
representations.  The  representation  is  the  set  of  symbols  that  are  used  to  form  C  code  and 
the  conditional  probabilities  of  a  code  being  selected  given  the  proceeding  n-1  symbols. 
Such  a  representation  is  suitable  to  represent  any  C  source  code  as  well  as  any  general  text. 
The  recognition  process  over  the  n-grams  used  the  eigenspace  low  dimensional 
categorization  technique  used  in  the  face  recognition.  This  was  chosen  over  the  clustering 
techniques  because  the  eigenspace  systems  allow  a  coherent  approach  to  categorization  in 
which  the  underlining  space  can  be  understood  in  terms  of  its  structure.  In  particular,  one 
might  want  to  look  for  a  piece  of  C  source  code  for  which  one  does  not  have  a  good 
example,  but  can  describe  by  its  probable  location  in  the  eigenspace  dimensions. 

A  group  of  eight  sof tware  programs  were  selected  to  test  the  ability  of  a  filter  to  derive 
dimensions  for  an  eigenspace  that  would  result  in  meaningful  categorization  of  C  code. 
The  programs  were  selected  based  on  the  criteria  that  the  programs  performed  a  single 
function  and  the  function  of  the  code  was  either  system  utility,  mathematical,  statistical,  or 
logical.  The  users  comments  and  extraneous  lines  were  removed  from  the  code.  The 
programs  were  then  encoded  in  n-grams  with  an  n  of  one  (i.e.,  symbol  frequency).  All 
possible  one-gram  symbols  were  first  collected  and  assigned  a  position  in  a  vector.  Each 
vector  was  normalized  by  calculating  the  n-gram  likelihood  for  each  program  by  dividing 
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each  n-gram  by  the  total  number  of  n-grams  in  the  program.  The  average  n-gram  vector 
was  calculated.  A  principle  components  analysis  was  performed  given  7  factors  or 
dimensions  for  the  code  space.  A  characterization  of  the  dimension  was  achieved  by  the 
author  by  examining  the  scorings  weights  for  the  symbols. 

The  code  space  approach  to  building  a  low  dimensional  filter  proved  to  be  a  promising 
approach  to  categorizing  software  source  code.  The  method  produced  a  coherent, 
understandable  space  in  which  should  be  advantageous  to  automated  software  reclaiming 
techniques.  A  surprising  finding  that  the  programmer's  style  is  a  major  dimension  detracts 
some  from  the  coherency  of  the  space.  However,  that  finding  is  not  all  bad.  The 
knowledge  that  a  piece  of  software  is  programmed  in  a  structured  format  may  well  be 
useful  knowledge  for  any  search  or  post  filter  analysis  of  the  software.  For  example, 
formal  methods  that  decompose  legacy  software  to  understand  its  functions  or  intentions 
might  work  best  with  structured  code.  Humans  might  want  to  limit  searches  to  structured 
code  so  that  any  software  found  will  be  easier  to  comprehend  and  to  verify  as  useful. 

Evolutionary  Systems 

The  present  proposal  is  to  meld  the  above  software  recognition  system  with  an  evolutionary 
computational  system  to  create  an  adaptive  and  bootstrapable  recognition  system.  The 
evolutionary  system  will  be  used  to  select  the  set  of  n-grams  that  best  differentiate  a 
software  space.  The  creation  of  such  an  adaptive  cognitive  system  for  the  evaluation  and 
recognition  of  software  source  code  would  be  an  important  step  towards  developing  the 
critical  evaluation  function  for  an  evolutionary  software  system  that  automatically  generates 
software  code.  Finally,  the  approach  is  generalizable  to  a  large  domain  of  evaluation  and 
recognition  tasks  (such  as  audio,  image,  logic  and  text)  because  the  methods  use  atomic 
representations  which  support  holistic  processes. 
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Development  of  the  system  relies  on  an  evolutionary  computational  system  of  Noel  and 
Acchione-Noel  [7]  shown  to  be  able  evolve  holistic  objects  that  use  atomic  representations. 
So  far  the  system  has  only  been  used  to  evolve  images  from  humans,  but  like  the  eigenface 
system  above  the  process  can  be  made  to  acconunodate  text  or  code  using  n-gram 
representation.  Important  for  the  present  project  is  the  question  of  the  complexity  and  the 
ability  of  the  proposed  system  to  evolve  the  level  of  complexity  to  effectively  recognize 
code.  Since  questions  of  the  evolutionary  process  and  complexity  are  crucial  to  the  project 
a  brief  description  the  evolutionary  system  to  be  used  will  follow. 

Holistic  processes  in  evolution  are  the  processes  of  evaluating  fitness,  selecting  mates,  and 
producing  offspring  that  act  upon  large  numbers  of  simultaneously  interacting  genes.  The 
strength  and  order  of  the  interactions  preclude  the  decomposition  or  pre-definition  of  gene- 
to-feature  mappings  for  analysis.  The  present  paper  uses  a  new  system  that  uses  humans 
as  holistic  judges  for  evolving  images  in  a  representational  space  that  does  not  pre-define 
features  or  gene-to-feature  mappings.  The  system  demonstrates  human-machine  cognition, 
and  gives  a  direct  method  for  studying  and  understanding  the  effects  of  holistic  processes 
in  computation. 

Instead  of  using  a  feature-based  space,  the  system  used  a  pixel  space  that  effects  the 
resolution  of  the  image  space,  but  forces  no  dimensions  upon  the  images  themselves.  The 
space  is  based  upon  atomic  or  molecular  representation,  similar  to  the  notions  of  atomic  or 
molecular  decomposition  by  Fourier  Analysis  or  Wavelets  [  16].  As  championed  by  the 
pointillists,  small  points  of  just  a  few  colors  can  be  used  to  create  the  psychological 
impression  of  any  form  and  any  color.  The  use  of  atomic  representation  is  sub-featural  and 
allows  the  generation  of  features  along  with  their  configuration.  The  representation  is  not 
constrained  at  the  feature  level  and  encodes  a  dog,  a  tree,  or  a  car  as  easily  as  a  face.  For 
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instance,  one  could  create  a  space  of  25-by-25  pixels  with  each  pixel  being  any  of  eight 
colors.  Such  a  small  space  has  the  potential  to  create  an  enormous  number  of  images,  as 

many  as  2^^^.  The  number  of  possible  images  is  so  large  that  there  exists  no  real 
constraints  on  the  variety  of  forms  that  may  be  represented;  rather,  the  model  constrains  the 
resolution  of  the  image.  The  space  cannot  represent  objects  that  require  more  than  12.5 
lines  of  resolution  in  the  vertical  or  horizontal  axis,  but  such  a  constraint  can  be  reduced  by 
increasing  the  number  of  pixels  and  decreasing  the  pixels'  size. 

System  implementation  required  resolving  additional  issues  in  the  method  of  reproduction 
and  mutation  function.  First,  usually,  simple  cross-over  points  are  used  as  the  method  of 
reproduction,  but  such  a  linear  system  is  inappropriate  for  a  multi-dimensional  space. 
Instead,  we  increased  the  number  of  cross-over  points  until  the  reproductive  system 
considered  a  cross-over  point  at  every  allele.  Such  a  system  of  uniform  crossovers  was 
implemented  by  randomly  selecting  between  the  genes  of  the  two  parents  with  equal 
probability.  Uniform  crossovers  are  thought  to  be  deleterious  to  evolutionary  computation 
[17],  but  have  been  found  useful  by  others  [18].  Secondly,  if  one  uses  a  mutation  function 
that  chooses  among  all  possible  genes  with  equal  probability  for  an  allele,  the  mutation 
function  will  eventually  return  the  image  to  a  random  state.  Instead,  we  limited  the 
mutation  function  to  the  gene  values  of  neighboring  pixels,  causing  smaller  changes  and 
greater  adaptability. 

The  resulting  image  evolving  system  consists  of  a  comma  plus  system  (allows  incest)  using 
a  population  of  fifty  images  in  which  10  images  are  selected  by  the  human  for  each 
generation  iteration.  Note:  in  a  comma  plus  system  the  parents  are  available  for  selection 
in  the  next  generation  so  that  each  generation  after  the  first  is  made  of  parents  plus  their 
offspring.  The  genotype  representation  is  an  array  of  alleles  that  has  the  same  size  as  the 
pixel  representation  (25x25  pixels).  Each  allele  is  a  character  that  corresponds  to  one  of  the 
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possible  colors  (or  genes)  for  the  pixel.  Reproduction  creates  the  offspring  genotype  by 
randomly  and  uniformly  selecting  between  the  genes  of  two  randomly  selected  parents  at 
each  allele  site. 

This  image  evolution  provides  a  new  technique  for  integrating  the  best  qualities  of  human 
and  machine  capabilities  to  create  images.  Neither  system  could  produce  theses  images 
alone.  Machines  lack  the  perceptual  and  memory  skills,  and  humans  lack  the  ability  to 
execute  an  image  holistically.  The  results  show  that  current  theories  of  evolutionary 
computation  are  insufficient  to  explain  the  convergence  of  the  images  in  the  absence  of  a 
feature-based  parameterized  space.  A  new  theory  of  evolutionary  computation,  the 
Stochastic  Shift  Hypothesis,  explains  how  the  images  converge  and  provides  a  basis  for 
modeling  atomic-level  convergence  in  a  holistic  system. 

The  technique  of  image  elicitation  allows  humans  to  use  their  perceptual  and  cognitive 
systems  to  organize  visual  noise  into  the  objects  of  their  memories.  This  process  of 
literally  pulling  an  image  out  of  chaos  will  affect  our  understanding  of  intelligent  systems 
and  future  investigations  across  many  disciplines.  Image  elicitation  will  be  useful  in 
studying  machine  intelligence,  as  well  as  in  studying  top-down  processes  in  interactive 
intelligent  systems.  It  provides  a  means  for  humans  to  experience  how  evolutionary 
computation  works  by  directly  immersing  themselves  in  the  process.  And  it  provides 
cognitive  researchers  with  a  means  of  studying  human  recognition. 

The  system  creates  representations  with  a  new  level  of  complexity  over  previous  work  in 
evolutionary  computation.  The  argument  for  the  increases  in  complexity  is  based  on  an 
increase  in  the  cardinality  of  the  relationships,  increases  in  the  number  of  emergent 
properties,  and  an  increase  in  what  Lofgren  calls  interpretation  and  descriptive  processes 
[19].  In  the  representation,  the  potential  for  complexity  is  related  to  the  relationships 
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among  the  features.  In  our  system,  the  features  are  described  at  an  atomic  or  molecular 
level  (in  our  case,  points  of  light).  The  low  level  of  description  allows  for  an  extremely 
large  number  of  relationships  to  form  as  compared  to  methods  that  search  for  relationships 
among  high  level  features,  such  as  a  nose  or  eyes  in  a  face.  Our  system,  which  takes  the 
stronger  theoretical  stance  in  representing  holistic  processes,  does  not  describe  the  high 
level  features;  the  features  themselves  must  emerge.  Having  the  features  emerge  results  in 
a  greater  number  of  emergent  properties,  including  all  of  the  features  and  the  configuration, 
rather  than  the  configuration  only.  Finally,  our  system  has  both  polygeny  and  pleiotropy. 
Because  of  these  relationships  between  genotype  and  phenotype,  the  complexity  of  both 
interpretation  and  description  have  increased.  Lofgren  associated  those  complexities  to 
computational  complexity  and  Kolmogorov  complexity,  respectively. 

Perhaps  the  most  important  increase  in  complexity  for  evolutionaiy  systems  came  from  our 
intentions.  Current  theory  is  built  upon  a  mind  set  of  parametized  modeling,  or  reducing 
complexity  of  search  through  decomposition.  Our  intention  was  to  embrace  as  much 
complexity  as  possible  so  as  to  explore  a  more  complex  (and  natural?)  evaluation  function. 
The  author  believes  that  advances  in  the  generality  and  the  effectiveness  of  evolutionary 
search  will  come  from  increases  in  complexity. 


Project 

The  intent  of  this  project  is  to  create  a  system  that  recognizes  the  intentions  of  the  source 
code  and  programmer’s  comments  for  the  programming  language  C.  The  system  will 
create  an  informational  eigenspaces  that  encodes  the  code  and  comments  of  C  software. 
The  system  will  be  self  organizing  (bootstrapable)  and  evolutionary  because  of  the 
extremely  large  and  computational  complexity  of  the  problem.  Specially,  the  project  seeks 
to  develop  the  concepts  and  methods  necessary  to  accomplish  the  above  vision. 
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In  software  engineering  the  intentions  for  creating  a  piece  of  software  are  separate  from 
understanding  the  actual  algorithm  or  computation  done  by  the  software.  For  instance, 
one  might  have  intended  to  write  a  program  to  calculate  the  interest  on  a  loan  given  the 
amount  of  the  loan  and  the  annual  percentage  rate.  However,  because  of  a  bug,  the 
program  actually  calculates  something  very  different.  In  this  case,  there  is  a  mismatch 
between  intentions  and  the  actual  computation.  Also,  the  intentions  may  be  implemented  in 
a  variety  of  ways  with  some  implementation  being  more  advantageous  then  others.  If  one 
searches  for  code  that  performs  a  computation  in  a  specific  manner,  one  might  miss 
software  that  does  the  same  computations,  but  in  a  different  and  more  advantageous 
manner. 

Consider,  a  searcher  might  look  for  software  that  calculates  the  interest  on  loans,  and  in 
doing  so  finds  software  that  handles  numerical  problems,  such  as  rounding  errors,  in 
actuate  ways  that  the  searcher  never  considered.  The  searcher  might  then  change  their 
intentions  for  the  software  to  match  the  better  software.  Alternatively,  the  searcher  might 
use  the  set  of  software  that  has  the  intentions  to  calculate  interest  to  determine  the  nature  of 
interest  computations.  In  summary,  there  are  many  advantages  to  the  top-down  intentions 
approach,  but  the  approach  must  be  used  in  conjunction  with  other  systems,  human  or  not, 
that  can  determine  whether  the  computations  match  the  intentions. 

To  gamer  the  advantages  of  an  intentional  approach  one  must  find  a  system  capable  of 
recognizing  or  categorizing  intentions.  Until  recently  this  was  not  considered  possible 
since  intentions  have  holism.  Holism  means  something  cannot  be  decomposed  into  a  finite 
set  of  features  and  relations.  Specially,  intentions  are  thought  to  be  context  dependent  and 
therefore  cannot  be  decomposed,  therefore  the  traditional  methods  of  scientific  analysis  are 
not  available.  However,  modem  techniques  in  information  compression  and  evolutionary 
have  developed  representations  and  processes  that  are  capable  of  holistic  recognition. 
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Approach 


The  heart  on  the  current  approach  is  the  use  of  principle  component  analysis  to  reduce  the 
dimensionality  of  C  source  code  into  a  small  coherent,  meaningful  subspace.  The 
subspace,  to  be  meaningful,  must  position  similar  C  source  code  in  proximity  with  each 
other.  Code  proximity  is  the  result  of  positioning  items  in  the  subspace  according  to  their 
score  on  the  dimension  that  make  up  the  subspace.  If  the  dimensions  separate  the  code  into 
meaningful  and  salient  dimensions  then  the  subspace  will  have  a  useful  coherency  that  will 
allow  the  use  of  vector  distance  in  logical  and  fuzzy  searches.  Given  useful  coherency  then 
one  could  use  the  location  in  subspace  to  search,  to  limit  search,  and  to  rank  order  possible 
code  sources.  Noel’s  (1996)  research  demonstrated  that  a  principle  components  analysis 
based  on  n-grams  in  C  source  code  is  capable  of  creating  a  coherent  subspace  for  C  code 
classification.  However,  a  classification  based  solely  on  the  primitives  of  C  has  limited 
usefulness  to  capture  the  intentions  of  the  software. 

A  finding  of  Noel  (1996)  is  that  the  coherency  of  C  source  code  can  be  of  two  types, 
structural  and  semantic.  If  one  created  a  subspace  based  on  just  the  code  or  structural  part 
of  programs  then  coherency  would  be  based  on  algorithms  similarities.  For  instance,  a 
program  that  calculates  simple  interest  might  be  similar  to  a  program  that  calculates  the 
amount  of  light  reflected  from  a  surface  (both  are  a  percent  of  a  number)  but  both  might  be 
different  from  a  program  that  calculates  compound  interests  ( a  program  that  uses  either 
iteration,  or  growth  approximation).  On  the  other  hand,  if  one  created  a  subspace  based  on 
the  text  of  variable  names,  function  names,  and  programmer  comments,  then  programs  that 
use  the  same  words  or  morphemes  (meaningful  parts  that  makeup  words)  would  be  close 
in  that  space.  In  such  a  space,  programs  that  compute  simple  and  compound  interest 
would  be  next  to  each  other  and  programs  that  compute  reflected  light  would  be  at  a 
distance.  The  two  subspaces  could  be  used  simultaneously  by  using  fuzzy  logic  to  conduct 
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searches  or  as  a  way  to  bridge  between  the  structure  and  semantics.  For  example,  one 
might  used  a  text  description  of  what  the  program  is  to  do  to  first  find  matches  in  the 
semantic  space,  then  use  the  matches  to  bridge  into  the  structure,  and  possible  back  again. 

Good  A^-grams 

The  n-gram  technique  is  based  on  super-computing  techniques  that  seek  to  collect  the 
frequencies  of  all  n-grams  in  a  text  up  to  some  window  size  (usually  ai=  5  to  10).  The 
number  of  different  n-grams  grows  exponentially  with  window  and  sample  text  size.  The 
possible  number  is  limited  only  by  number  of  words  and  the  possible  order  of  the  words  in 
the  sample  text.  The  present  technique  forestalls  the  complete  use  of  the  n-gram  technique 
in  that  the  number  of  possible  elements  that  a  principle  components  analysis  can  use  is 
severely  limited.  The  PC  software  used  in  the  present  project  allows  up  to  100  variables 
(n-grams)  in  the  principle  components  analysis.  Note  that  even  for  main  frame  computing, 
principle  component  analysis  software  is  usually  limited  to  the  three  to  four  hundred.  This 
is  because  an  exponential  relationship  exists  between  the  number  of  variables  and 
calculation  required  to  perform  a  principle  components  analysis.  Given  that  the  present 
technique  must  use  an  incomplete  set  of  n-grams,  it  follows  that  a  method  to  determine 
which  n-grams  to  use  in  the  principle  components  analysis  must  be  established.  To  this 
end,  one  needs  to  develop  the  criterion  on  which  to  base  a  selection,  and  a  method  to 
perform  the  selections. 

The  first  notion  upon  which  a  criterion  can  be  developed  is  the  notion  of  the  completeness 
of  a  set  of  n-grams  to  act  as  a  covering  set.  The  manner  in  which  n-grams  are  formed 
leads  to  each  token  for  a  variable  (letter  or  word  depending  on  resolution  of  the  n-grams) 
being  encoded  multiple  times.  In  fact,  given  the  sliding  windowing  used  to  encode  in  n- 
grams  each  element  will  be  encoded  into  n  separate  n-grams.  For  example,  in  a  letter-level 
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gram  analysis  of  n  =  3,  one  would  find  the  letter  “e”  in  the  word  “attachment”  in  the  three 
n-grams;  “hme”,  “men”,  and,  “ent”.  One  could  reduce  the  total  number  of  n-grams  to 
cover  a  text  while  maintaining  an  entiy  for  each  variable.  However,  the  reduction  requires 
that  one  select  the  n-grams  in  the  set  in  some  way  that  best  maintains  the  usefulness  of  the 
remaining  n-grams,  or  a  good  estimate  of  the  important  sequential  structure  or  syntax 
amongst  the  elements.  To  do  this  one  must  define  “goodness”  of  n-grams  for  our  present 
purposes. 

An  important  issue  is  the  relationship  between  structure  (syntax)  and  meaning  (semantics.) 
A-grams  are  a  method  for  capturing  the  sequence  of  symbols.  The  sequence  of  C  source 
code  is  directly  related  to  the  syntax  of  algorithm  and  the  comments.  This  is  because 
intentions  have  both  structure  and  semantics.  Intentions  require  that  persons  have  both  a 
goal,  and  a  general  method  to  accomplish  the  goal.  For  example.  If  a  person  has  the  goal 
to  be  mayor  of  Troy,  New  York,  then  intentions  come  into  play  only  when  the  person 
decides  to  run  for  the  office.  Once  the  person  intends  to  run  for  major,  their  behavior 
becomes  organized  around  the  actions  necessary  to  become  elected.  One  could  determine 
or  infer  their  intentions  by  either  listening  to  their  statement  about  their  intentions  to  run,  or 
watching  their  actions  (running  for  mayor  requires  certain  sequences  of  actions.)  Since, 
intentions  organize  behaviors  then  one  can  infer  intentions  by  the  structure  of  a  person’s 
behavior. 

The  n-grams  that  the  present  analysis  needs  are  those  n-grams  that  best  flag  the  differences 
between  groups  of  source  code.  What  one  looks  for  are  n-grams  that  are  common  for  one 
group  of  source  code,  but  are  rare  for  another.  The  assumption  is  that  the  word  or  word 
part  used  to  convey  intentions  should  be  common  in  text  with  the  intention,  but  rare  in 
others.  This  difference  in  likelihood  of  occurrence  allows  the  information  metrics,  or 
Shannon’s  complexity  measure  as  an  evaluation  function  of  gram  goodness. 


13-22 


Interim  Work 


The  author  used  n-grams  of  length  equal  to  two  as  the  set  of  variables  to  measure  the 
frequencies  of  co-occurrence  of  C  source  code  which  were  used  to  create  the  eigenspace 
used  in  recognition.  While  useful  when  dealing  with  a  limited  set  of  vocabulary  of  C  code, 
the  approach  failed  when  scaled  to  the  larger  problem  of  the  vocabulary  of  programmer 
comments.  A  large  portion  of  the  grant  work  was  focused  on  the  problem  of  creating  a 
system  that  could  find  a  limited  set  of  symbols  upon  which  a  scaleable  system  can  be 
created.  After  many  attempts  to  work  directly  with  n-grams  in  an  evolutionary  approach 
the  research  turned  to  reanalyzing  the  role  of  n-grams.  This  rethinking  and  generalizing  of 
the  approach  led  to  a  more  generalized  and  less  constrained  formation  of  the  gram  which 
the  author  call  Rosettas,  after  the  Rosetta  Stone  used  in  deciphering  Egyptian  Hieroglyphs. 

While  the  problem  of  n-grams  that  can  be  used  is  unlimited,  the  number  of  symbol 
frequencies  used  in  principle  components  analysis  is  extremely  limited  due  to  the 
exponential  growth  in  computation.  The  software  used  in  the  research  runs  on  PC’s  and 
allows  for  only  one  hundred  symbols.  Principle  components  packages  for  larger  computer 
systems  are  still  limited  to  three  hundred  to  five  hundred  symbols.  Obviously,  a  traditional 
n-grams  approach  could  only  be  complete  with  n-grams  of  n  =  1.  In  a  limited  vocabulary 
such  as  C  source  code,  one  can  find  an  adequate  set  of  n-grams  by  using  only  those  n- 
grams  of  n  =1  or  2  that  are  contained  in  the  sample  source  code  and  that  further  reduce  the 
set  by  selecting  the  best  particle  set  of  n-grams  based  on  the  symbol  weightings  during 
multiple  runs  of  the  principle  components  analysis.  Trying  to  scale  the  system  to  large  n’s 
the  author  quickly  came  to  the  conclusion  that  n-grams  are  far  too  many  and  the  diagnostic 
value  of  any  one  gram  was  far  too  limited  to  form  a  viable  approach.  At  this  juncture,  the 
use  of  n-grams  was  reconsidered. 
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Semantic  A^-grams:  Rosettas 


As  stated  before  n-grams  can  be  considered  a  type  of  covering  set  for  text.  The  set  of  n- 
grams,  n  =  1,  that  comprise  the  alphabet  will  cover  all  words,  meaning  that  for  any  word, 
there  is  a  set  of  n-grams  that  will  cover  all  letters  in  the  word.  This  covering  is  complete, 
but  with  limited  and  hard-to-interpret  results.  For  instance,  one  that  might  use  the  letter 
“a”  more  frequently  then  another  text,  but  for  no  discernible  reason.  A  recognition  system 
built  upon  such  differences  would  not  be  robust  when  used  to  categorize  new  text  samples. 
As  the  n  becomes  greater  (particularly  when  n  =  5  or  greater)  the  interpretability  increases, 
but  so  does  the  need  for  a  large  number  of  different  n-grams.  Basically,  for  our  purposes 
A-grams  are  computational  too  large. 

The  problem  revolves  around  trying  to  capture  all  specific  instances  that  lead  to  a  general 
instance.  Idiosyncratic  words,  consistent  misspellings  and  unusual  prose  in  diaries  were 
used  to  connect  Kaczynski  to  the  Unibomber’s  manuscript.  What  is  useful  in  the 
individual  or  specific  case  is  not  useful  in  general  case.  For  instance,  for  most  cases 
dealing  with  meaning  one  would  not  care  if  one  spelled  “color”  or  “colour.”  The 
distinction  between  the  author  being  British  or  American  may  be  of  little  use  in  discovering 
the  intentions  of  the  software  but  would  cause  the  need  for  multiple  n-grams,  and  may 
cause  unwanted  dimensions  in  the  codespace. 

A-grams  cover  too  much.  Much  of  the  written  English  is  redundant  and  of  little  diagnostic 
value.  The  word  “the”  occurs  frequently  in  our  language,  buts  adds  nothing  to  the  meaning 
of  text.  While  the  gram  “the”  may  cover  a  large  portion  of  all  text,  it  is  of  little  use. 

Further,  n-grams  cover  too  little.  The  gram  for  “the”  might  occur  frequently,  but  is  of  little 
use  in  a  generalized  recognition  system.  A-grams  capture  the  sequential  nature  of  text,  and 
therefore  do  not  cover  components  of  the  meaning  that  do  not  occur  sequentially. 
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For  instance,  consider  the  verb  tense  of  prose.  Past  tense  might  be  indicated  by  the 
morpheme  “ed”,  but  also  by  the  various  irregular  verbs  like  “was”,  “had”,  “were”,  etc.  A 
single  n-grams  by  its  sequential  nature  cannot  capture  past  tense  even  though  past  tense  n- 
grams  have  a  high  likelihood  of  co-occurring  in  the  same  text.  If  one  loosens  the  nature  of 
a  n-grams  to  allow  the  encoding  of  patterns  that  co-occur  in  non-sequential  ways,  then  one 
could  create  a  structure  that  could  encode  multiple  sequences  that  relate  to  a  primitive 
semantic  feature  such  as  verb  tense.  This  possible  relaxed  structure  is  the  idea  that  forms 
the  heart  of  a  rosetta.  Specially,  a  rosetta  is  a  sequence  of  letters  (or  symbols)  that 
differentiates  two  sets  of  the  stimuli  (arbitrary  groups  of  C  source  code  files)  by 
proportionally  covering  larger  segments  of  one  set  of  stimuli  than  the  other. 

Consider  a  rosetta  that  could  encode  primitive  text  semantics.  The  rosetta  “was  had  went 
ed  were”  might  be  related  to  the  tense  of  a  text  that  is  written  about  the  past.  The  rosetta 
“good  better  best  ist  most”  might  encode  text  that  is  written  to  praise  something.  And  the 
rosetta  “she  her  mrs  miss  ms  tress”  might  encode  a  text  that  is  written  about  a  female.  As 
can  be  seen  the  parts  that  make  up  a  rosetta  would  not  normally  occur  in  sequence  in  a  text. 
However,  the  parts  of  the  rosetta  could  be  spread  across  a  text  that  contains  the  semantic 
component  encoded  by  the  rosetta. 

The  result  of  moving  the  encoding  of  syntax  to  that  of  primitive  semantic  levels  should 
increase  the  ability  of  the  recognition  system  to  be  sensitive  to  more  complex  semantic 
structures.  The  problem  remains  as  to  how  to  find  and  evaluate  such  rosettas. 


Methodology 

To  use  an  evolutionary  approach  to  compute  a  rosetta  requires  defining  what  a  rosetta  is  in 
terms  useful  for  evolutionary  simulation.  Therefore  it  is  necessary  to  define  a  genotype. 
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phenotype,  fitness  function  and  the  evolutionary  systems  capable  of  evolving  rosettas.  The 
requirements  are  that  the  rosettas  be  large  in  terms  of  length,  and  capable  of  containing 
letters,  morphemes,  and  words  that  exist  across  a  text  that  encode  a  meaningful  component 
of  the  text’s  meaning. 

The  chromosome  of  the  rosetta  was  selected  to  have  a  length  of  30.  Each  gene  can  be 
ascribed  to  any  of  the  26  capital  letters  of  the  alphabet  plus  a  token  to  represent  a  space. 

This  required  the  preprocessing  of  text  to  capitalize  all  lower  case  letters,  and  replace 
numbers  and  punctuation  with  a  space.  The  length  of  30  was  selected  based  on  the 
following  intuitions.  First,  the  length  of  the  rosetta  should  be  long  enough  to  be  contain 
enough  different  words  and  word  parts  so  as  to  be  sensitive  to  the  present  of  it’s  encode 
meaning  in  a  text.  For  instance,  a  rosetta  that  tried  to  encode  feminine  gender  by  just  “mrs” 
would  not  be  sensitive  to  feminine  gender  in  texts  that  do  not  communicate  martial  status. 
Second,  the  length  should  be  longer  than  26  so  that  the  initial  generations  could  contain  all 
possible  letters.  And  third,  a  length  too  great  could  increase  computational  time  without 
necessarily  increasing  the  quality. 

For  each  iteration,  two  groups  of  text  are  randomly  selected  in  the  following  manner.  First 
the  number  of  different  texts  that  makeup  the  first  group  is  pseudo-randomly  drawn  by  the 
computer  with  an  equal  probability  function  of  1  though  N  -1  (N  being  the  total  number  of 
texts,  which  is  8  in  this  study.)  The  number  of  texts  in  the  second  group  is  pseudo- 
randomly  drawn  from  a  equal  probability  function  of  1  through  N  -  (the  first  group  size). 
The  actual  texts  are  then  selected  randomly  to  fill  the  groups.  Note:  the  eight  text  used  are 
from  the  previous  study  (Noel,  1996.)  The  selections  are  made  pseudo-  randomly  without 
replacement.  The  population  size  for  each  generation  was  selected  to  be  250  of  which  one 
fifth  (50)  were  selected  to  be  parents  for  the  next  generation. 
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The  fitness,  or  ability  of  the  rosetta  to  be  useful  is  measured  in  terms  of  information  metrics 
(i.e,  the  ability  to  differentiate  groups  of  code).  The  evaluation  function  is  the  amount  of 
information  difference  that  a  rosetta  produces  between  to  sets  of  C  source  code  text. 
Informational  difference  is  measured  as  the  average  difference  in  information  needed,  the 
number  of  times  the  rosetta  is  applied  to  cover  a  text  group,  or  to  cover  two  groups  of  text. 
Computer  software  was  developed  to  evolve  rosettas  on  PC  computers  in  the  above 
maimer. 

Results 


The  evolutionary  time  required  to  evolve  a  rosetta  as  described  above  is  about  75 
generations.  The  average  wall  clock  time  to  evolve  one  rosette  on  a  thirty-five  megahertz 
486  IBM  PC  with  unoptimized  software  was  about  twelve  hours.  This  speed  of 
computation  required  fifty  days  to  evolve  all  100  rosettas  for  the  first  iteration  of  forming  a 
codespace.  The  computer  time  was  seen  as  a  potential  issue  for  the  systems  so  efforts  to 
reduce  the  time  were  undertaken.  At  present,  the  evolution  of  one  rosetta  is  under  thirty 
minutes.  Also,  the  computation  of  the  rosettas  can  be  done  in  parallel  so  the  time  to  evolve 
all  100  rosettas  would  be  the  same. 

Figure  1  shows  the  Eigenvalues  of  a  Principle  Components  Analysis  with  Varimax  rotation 
of  the  100  rosettas.  The  first  three  factors  form  the  best  factors  upon  which  to  build  the 
codespace.  The  three  factors  account  for  70  percent  of  the  trace  while  being  both  coherent 
and  general  dimensions.  The  factors  relate  to  using  structured  programming  techniques, 
file  and  system  processing.  The  other  factors  simply  separated  one  program  from  the  rest 
of  the  examples. 
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Principle  Components  Analysis 


Factors 


Figure  1.  Eigenvalues  for  vector  extraction  during  principle  components  analysis 


Inspection  of  the  rosettas  reveal  that  the  evolutionary  approach  was  able  to  find  patterns  that 
contain  letter  sequences  related  to  the  intentions  of  the  C  source  code.  For  instance, 
examine  the  rosetta  in  Figure  2  that  loaded  highest  in  determining  if  the  source  code 
involved  file  processes.  The  letter  sequence  “CLO”  would  be  sensitive  to  the  use  of  the 
word  “close”.  The  letter  sequence  word  “FL”  is  used  in  both  “flush”  and  “flag”.  The 
sequence  “NAM”  is  used  in  “name”  and  “filename”.  And,  the  sequence  “FILE”  stands  by 
itself.  Three  points  can  be  made:  (1)  it  would  be  hard  to  write  C  source  code  that  intends 
to  do  file  processes  using  Standard  10  that  did  not  use  the  underlining  words,  and  so  it 
would  be  coded  by  this  rosetta,  (2)  these  sequences  are  unlikely  in  source  code  that  did  not 
do  file  processes,  and  (3)  the  rosetta  appears  to  be  coherent  in  that  it  relates  to  just  file 
source  code. 
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Figure  2.  The  rosetta  that  loads  highest  on  the  factor  encoding  file  processes. 


Conclusions 


The  progress  made  in  achieving  an  evolutionary  system  to  recognize  intentions  is 
promising,  but  limited.  To  summarize,  the  initial  idea  of  using  n-grams  to  encode  C  source 
code  and  programmer  comments  and  then  the  creation  on  a  codespace  (eigenspace)  to 
recognize  intentions  was  a  failure.  The  failure  was  due  to  the  high  computational  demands 
of  large  n-grams,  and  the  insufficiency  of  small  n-grams  to  accomplish  the  tasks.  As  a 
result,  an  alternative  approach  was  tried.  The  approach  used  a  new  concept  called  a  rosetta. 
The  rosetta  is  a  sequence  of  letters  (or  symbols)  that  seeks  to  differentiate  two  sets  of  the 
stimuli  (arbitrary  groups  of  C  source  code  files)  by  proportionally  covering  larger 
segments  of  one  set  of  stimuli  than  the  other.  The  fitness,  or  ability  of  the  rosetta  to  do  this 
is  measured  in  terms  of  the  reduction  of  information  (i.e.  the  increased  ability  to 
differentiate  groups  of  code).  The  usefulness  of  a  rosetta  to  capture  semantics  is  later 
determined  by  it’s  weighting  in  forming  a  factor  in  an  Eigenspace. 

The  findings  are  from  the  development  of  one  set  of  a  hundred  rosetta  and  the  formation  of 
an  eigenspace  from  that  set.  As  such,  findings  are  a  proof  of  concept  of  the  ability  of  an 
evolutionary  system  to  find  rosettas,  and  the  ability  of  some  of  those  rosettas  to  encode  the 
stimuli  in  a  meaningful  way.  What  is  missing  is  a  demonstration  that  such  a  system  can  act 
in  unsupervised  ways  and  refine  its  abilities  to  resolve  intentions. 
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Subsequent  activities  by  the  author  have  moved  towards  the  creation  of  such  a  system. 
These  activities  included  building  a  software  spider  to  search  the  Internet  to  find  C  source 
code.  A  senior  Computer  Science  student,  Michael  Corbett,  was  funded  for  one  half  year 
by  the  author  with  further  funding  by  Anderson  Consulting  Company  to  examine  the  web 
spider  possibilities.  The  student  developed  a  shareware  spider  that  he  modified  to  look  for 
C  source  code  on  the  web.  His  findings  where  not  promising  in  that  web  sites  do  not 
normally  contain  source  code,  and  that  seeking  such  code  outside  the  immediate  web  site 
might  involve  ethical  considerations.  Additional  funding  for  the  project  was  obtained  by 
including  the  project  in  the  formation  of  an  applied  cognitive  science  laboratory  at 
Rensselaer  Polytechnic  Institute  called  the  Minds  and  Machines  laboratory.  Continuance  of 
this  project  is  funded  by  the  institute’s  funding  for  the  laboratory.  Also,  further 
programming  and  work  is  being  through  graduate  level  class  projects. 
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Abstract 

Field  programmable  gate  arrays  (FPGA)  are  powerful  new  technology  which  can  be  used  to 
maximum  advantage  in  military  software  radio  applications.  The  objective  of  this  research  is  to 
examine  the  potential  role  of  the  FPGA  in  the  implementation  of  high  performance  military  radio 
algorithms.  A  radio  transceiver  implemented  using  state-of-the-art  DSP  technology  -  often 
referred  to  as  software  radio  -  requires  real  time  signal  processing  at  a  variety  of  bandwidths.  In 
order  to  accommodate  the  needed  bandwidths  in  a  discrete  time  implementation,  it  is  appropriate 
to  use  devices  which  are  well  suited  to  each  stage  of  the  system  -  fast,  yet  algorithmically  simple 
devices  for  the  wide  bandwidth  stages  and  slower,  yet  more  flexible  devices  for  the  processing 
required  at  low  bandwidths.  This  report  briefly  discusses  the  processing  requirements  of 
software  radio,  and  assesses  the  role  of  the  current  generation  FPGA  technology  in  implementing 
the  algorithms  required  to  make  these  radio  systems  function  efficiently.  A  case  study  is 
provided  of  a  digital  filter  design  using  FPGAs. 
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RAPID  PROTOTYPING  OF  SOFTWARE  RADIO  SYSTEMS  USING 
FIELD  PROGRAMMABLE  GATE  ARRAYS 


Glenn  E.  Prescott 


1.  Introduction 

Until  recently  radio  transmitters  and  receivers  were  almost  exclusively  implemented  with  analog 
electronic  components.  However,  a  new  approach  is  now  becoming  popular  -  one  that  employs 
digital  electronics  to  implement  most  of  the  analog  signal  processing  functions  in  the  radio.  This 
evolution  in  radio  system  design  is  driven  by  the  ever  increasing  speed  and  decreasing  cost  of 
microprocessors  and  high  performance  analog-to-digital  (ADC)  and  digital-to-analog  (DAC) 
converters.  It  is  no  longer  uncommon  to  sample  a  received  signal  at  the  intermediate  frequency 
(IF)  stage  and  process  the  signal  with  numerical  algorithms  using  a  specialized  digital  signal 
processing  (DSP)  hardware.  The  DSP  hardware  performs  a  variety  of  operations  on  the  signal 
including  down  conversion,  demodulation,  and  filtering;  all  of  which  are  inherently  continuous¬ 
time  (i.e.,  analog)  processes. 

1.1  Digital  Signal  Processing:  Capabilities  and  Requirements 

The  mathematics  of  digital  signal  processing  provides  the  framework  for  the  design  of  software 
radio  algorithms,  while  modem  high  speed  digital  electronic  components  make  real  time 
implementation  of  these  algorithms  possible.  However,  the  hardware  currently  available  to 
implement  DSP  algorithms  for  all  stages  of  the  radio  system  is  still  limited  in  speed,  accmacy 
and  flexibility.  Initially,  digital  signal  processing  was  used  only  for  baseband  waveform 
processing.  As  digital  electronic  devices  increased  in  speed,  DSP  was  soon  applied  to  signal 
processing  functions  performed  at  higher  frequencies  -  e.g.,  the  final  IF  stage  in  a  radio  receiver. 
Fimctions  such  as  IF  bandpass  filtering,  automatic  gain  control  (AGC),  and  coherent  modulation 
and  demodulation  are  t3q)ically  required  at  this  stage.  In  the  absence  of  a  sufficiently  high  speed 
processing  capability,  innovative  techniques  such  as  sub-sampling  are  used  to  process  bandpass 
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signals  of  small  to  moderate  bandwidth.  This  has  allowed  the  boundary  between  analog  and 
digital  processing  to  be  pushed  as  far  up  the  signal  path  towards  the  antenna  as  permitted  by 
physical  electronic  devices.  For  most  types  of  moderate  data  rate  communications  -  on  the  order 
of  100  kB/s  or  less  -  bandwidth  is  not  a  serious  barrier  to  DSP  techniques.  However,  military 
radio  systems  pose  a  notable  challenge  because  of  the  wide  bandwidth  characteristics  of  spread 
spectrum  modulation. 

1.2  Military  Radio  Signal  Processing  Requirements 

Military  communication  systems  often  require  the  use  of  spread  spectrum  techniques  to  provide 
an  antijam  (AJ)  capability;  or  some  measure  of  covertness  through  the  use  of  low  probability  of 
intercept  (LPI)  waveforms.  The  result  is  that  extremely  wide  bandwidth  signals  are  present  at  the 
output  stage  of  the  transmitter  and  the  input  stages  of  the  receiver.  We  know  from  the  Nyquist 
theorem  and  fundamental  bandpass  sampling  techniques  that  bandpass  signals  can  be  sampled  at 
a  rate  no  less  than  the  bandwidth  of  the  signal;  so  high  frequencies  alone  do  not  put  a  limitation 
on  DSP  processor  capability.  However,  wide  bandwidth  signals  are  a  challenge  for  any  type  of 
digital  signal  processing  hardware,  and  they  are  especially  troublesome  for  conventional  DSP 
microprocessors.  While  conventional  DSP  microprocessors  are  optimized  for  real-time  data 
processing,  they  are  nevertheless  implemented  using  the  traditional  von  Neiunaim  architecture  - 
an  inherently  serial  architecture  which  uses  a  single  multiplier  and  executes  one  instruction  at  a 
time.  While  providing  the  advantage  of  flexibility  through  programmability,  this  architecture 
limits  the  speed  with  which  signal  samples  can  be  processed.  Even  modem  DSP 
microprocessors  operating  at  40  million  instmctions  per  second  (MIPS)  have  a  useful  bandwidth 
Umit  of  less  than  500  kHz.  This  is  especially  troublesome  for  military  communication  systems 
which  employ  AJ  and  LPI  waveforms  having  typical  bandwidths  in  excess  of  10  MHz. 

1.3  Advantages  of  Specialized  Digital  Hardware 

When  digital  signal  processing  at  wide  bandwidths  is  required  the  radio  designer  turns  to 
specialized  hardware  which  can  operate  at  much  higher  throughputs  than  is  possible  with  a  DSP 
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microprocessor.  These  include  application  specific  standard  products  (ASSP),  application 
specific  integrated  circuits  (ASIC),  and  field  programmable  gate  arrays  (FPGA). 

Application  Specific  Standard  Products  (ASSP)  such  as  FIR  filters,  correlators,  and  FFT 
processors,  permit  certain  popular  DSP  algorithms  or  fimctions  to  be  optimized  in  hardware  at 
the  cost  of  flexibility.  Use  of  ASSPs  can  significantly  increase  the  device  count  and  often 
presents  special  interface  problems  which  can  lead  to  fiuther  complications.  Furthermore,  due  to 
a  narrow  range  of  applicability,  many  ASSPs  may  not  be  available  in  state  of  the  art  process 
technology  [1]. 

When  performance  is  a  factor  and  product  volume  is  high,  many  designers  turn  to  ASIC 
technology.  ASIC  technology  offers  the  ability  to  design  a  custom  architecture  that  is  optimized 
for  a  particular  application.  For  example  a  conventional  DSP  microprocessor  has  only  a  single 
multiply-accumulate  (MAC)  stage  (see  Section  3),  so  each  filter  tap  must  be  executed 
sequentially.  An  ASIC  implementation  of  a  DSP  algorithm,  on  the  other  hand,  might  have 
multiple  parallel  multiply-accumulate  (MAC)  stages.  When  comparing  the  performance  of  the 
ASIC  versus  the  DSP  microprocessor  it  becomes  apparent  that  the  DSP  microprocessor  offers 
slow  speed  but  maximum  flexibility  (due  to  programmability)  while  the  ASIC  provides  high 
speed  with  minimal  flexibility.  Between  these  two  extremes  lies  the  field  programmable  gate 
array  [2]. 

1.4  Software  Radio 

The  essential  concept  of  software  radio  is  that  most  of  the  analog  signal  processing  operations  of 
the  radio  transmitter  and  receiver  are  implemented  with  digital  hardware  using  DSP  techniques. 
The  placement  of  the  receiver  analog  to  digital  converter  (ADC)  and  the  transmitter  digital  to 
analog  converter  (DAC)  as  close  to  the  antenna  as  possible  are  distinguishing  characteristics  of 
the  software  radio.  In  the  software  radio  receiver,  the  approach  often  used  is  to  digitize  an  entire 
band  and  to  perform  IF  processing,  baseband,  bitstream  and  other  functions  completely  in 
software  [5].  This  approach  requires  the  use  of  high  speed  analog  to  digital  converters  and  high 
speed  DSP  microprocessors.  However,  the  signal  processing  requirements  for  military  and 
commercial  radio  systems  employing  high  data  rate  signals  or  spread  spectrum  modulation  easily 
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exceeds  the  processing  speeds  currently  available  in  off-the-shelf  DSP  microprocessors.  In  this 
case,  special  purpose  DSP  hardware,  application  specific  devices  and  field  programmable  gate 
arrays  can  play  an  important  role. 

The  motivation  for  implementing  radios  in  software  is  that  a  highly  flexible  and 
reconfigurable  communication  system  can  be  implemented  for  relatively  low  cost.  The  ability  to 
adapt  the  radio  to  its  environment  by  changing  filters,  changing  modulation  schemes,  switching 
channels,  using  different  protocols  and  dynamically  assigning  channels  and  capacity  are  features 
which  are  impractical  to  deliver  with  hardware  alone.  Since  the  behavior  of  the  software  radio 
can  be  changed  so  easily,  defining  a  particular  architecture  does  not  limit  the  radio  to  one  specific 
function.  Instead,  multiple  radio  systems  can  share  a  common  fi-ont-end  analog  radio  tuner  while 
having  independent  digital  processing  for  each  individual  radio  channel.  [5] 


2.  Discussion  of  the  Problem:  Field  Programmable  Gate  Arrays 

Modem  field  programmable  gate  arrays  can  implement  functions  beyond  the  capabilities  of 
today's  DSP  microprocessors.  In  fact,  they  have  the  potential  to  provide  performance  increases 
of  an  order  of  magnitude  or  better  over  traditional  DSP  microprocessors,  but  with  the  same 
flexibility  [3].  These  devices  can  provide  the  programmability  of  software,  the  high  speed  of 
hardware  and  can  be  reconfigured  in-circuit  with  no  physical  change  to  the  hardware.  In  fact, 
FPGAs  are  really  "soft"  hardware,  in  that  they  are  a  good  compromise  between  flexible  all- 
software  approaches  which  unfortunately  limit  throughput,  and  custom  hardware 
implementations,  which  are  more  expensive  and  inflexible  [4].  FPGAs  offer  a  powerful 
approach  -  an  architecture  tailored  to  the  specific  application.  Because  the  logic  in  an  FPGA  is 
flexible  and  amorphous,  a  DSP  function  can  be  mapped  directly  to  the  resources  available  on  the 
device.  Modem  FPGAs  have  sufficient  capacity  to  fit  multiple  MACs  or  algorithms  into  a  single 
device  along  with  the  interface  circuitry  required  by  the  application  -  a  single  chip  solution. 

Programmable  hardware  has  been  available  for  many  years  -  conventional  memory 
devices  are  the  most  obvious  example.  Various  PLDs  (programmable  logic  devices)  have  long 
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been  used  in  implementing  state  machines  and  "glue"  logic,  among  other  things.  However,  the 
available  devices  have  tended  to  have  restricted  architectures  and  to  be  rather  small  [7].  The  last 
decade  has  seen  a  significant  change  with  the  introduction  of  a  variety  of  field  programmable 
gate  arrays,  as  well  as  an  evolution  of  some  PLDs  into  much  larger  devices  with  extended 
architectures.  Essentially,  the  FPGA  is  a  general  purpose  programmable  logic  device  consisting 
of  a  regular  array  of  cells  with  distributed  routing  that  can  be  configured  with  a  specific  design 
by  the  user,  without  the  need  to  fabricate  an  application  specific  device  (i.e.,  an  ASIC)  [8], 

2.1  Programmable  Logic  Technology 

There  are  a  variety  of  FPGA  architectures  available  depending  upon  the  manufacturer.  However, 
there  is  one  broad  distinction  that  can  be  made  regarding  FPGA  structure:  the  architectures  are 
either  course-grained  or  fine-grained  [7].  The  earlier  devices  were  simple  arrays  of  logic  gates 
which  were  programmable  in  the  field  in  much  the  same  way  as  a  conventional  ROM.  These 
devices  are  considered  fine-grained  in  the  sense  that  there  can  be  a  large  number  of  very  simple 
logic  operations  which  can  be  interconnected.  On  the  other  hand,  modem  FPGAs  have  a 
relatively  smaller  number  of  more  complex  logic  cells  available. 

Other  than  granularity,  FPGAs  are  differentiated  by  their  chip  level  architecture  and  their 
interchip  wiring  organization.  As  an  example,  the  Xilinx  3000  family  FPGAs  consist  of  an  array 
of  cells  called  CLBs  (configurable  logic  blocks).  Each  CLB  contains  two  latches  and  a  fimction 
generator  as  illustrated  in  Figure  1.  The  internal  connections  within  the  cell  and  the  lookup  table 
in  the  fimction  generators  are  determined  by  configuration  bits  held  in  an  integrated  SRAM. 
This  allows  an  individual  cell  to  implement  quite  complex  combinational  and  sequential 
functions.  The  routing  resources  allow  the  cells  to  be  connected  as  required,  at  least  in  principle. 
In  practice,  the  problem  of  routing  a  congested  design  is  the  major  obstacle  in  obtaining  highest 
performance. 

FPGAs  are  just  begiiming  to  have  a  significant  impact,  although  their  cost  is  still 
relatively  high  (i.e.,  himdreds  of  dollars  for  the  largest  devices).  Two  application  areas  which 
traditionally  have  dominated  their  use  are  general  purpose  gate-level  logic  support  (i.e.,  glue 
logic)  and  emulation  of  new  IC  designs.  However,  FPGA  manufacturers  believe  that  their 
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products  will  change  the  way  in  which  digital  design  is  approached  in  a  revolution  similar  to  that 
engendered  by  the  microprocessor  [7].  The  fact  that  FPGAs  are  now  being  investigated  for  use 
in  high  speed  DSP  applications  is  an  indication  of  the  broad  impact  they  may  have  in  digital 
applications  of  all  kinds. 


Figure  1  -  Configurable  Logic  Block  of  the  X3000  FPGA 
2.2  Practical  Consideration  in  the  Use  of  FPGAs 

Because  the  FPGA  is  programmable  in  manner  similar  to  a  microprocessor,  it  is  already 
becoming  widely  used.  However,  the  configuring  of  hardware  to  fit  a  specific  computation  is 
significantly  different  fi'om  the  programming  of  a  microprocessor.  In  particular,  the 
microprocessor  has  a  fixed  instruction  set,  and  all  solutions  are  algorithmic  in  nature.  In  contrast, 
an  FPGAs  internal  structure  must  be  customized  to  implement  a  particular  algorithm.  Since 
digital  hardware  designs  are  not  software  driven,  the  overhead  associated  with  command 
interpretation,  scheduling  and  execution  is  eliminated  and  there  is  a  substantial  gain  in  speed. 
Furthermore,  a  hardware  design  can  take  advantage  of  parallel  unplementations  to  eliminate 
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bottlenecks  [2].  It  is  interesting  to  note  that  we  may  even  combine  the  two  approaches  and 
compile  a  specialized  microprocessor  into  the  FPGA  with  a  restricted  instruction  set  chosen  to 
suite  any  particular  application. 

It  often  occurs  that  a  computation  is  better  suited  for  either  dedicated  hardware  or 
microprocessor  software.  This  is  the  situation  we  are  examining  in  the  software  radio  -  when  to 
use  FPGAs  and  when  to  use  DSP  microprocessors.  Simply  stated,  an  FPGA  is  appropriate  when 
the  design  calls  for  the  performance  of  an  ASIC  and  the  flexibility  of  a  microprocessor.  An 
FPGA  should  not  be  used  if  the  algorithms  to  be  implemented  are  complex,  or  vary  significantly 
in  structure  or  complexity.  Determining  when  to  offload  DSP  algorithms  to  FPGAs  requires  an 
analysis  of  speed  versus  problem  size.  At  one  end  of  the  scale,  problem  size  gets  very  large  and 
direct  hardware  solutions  become  too  difficult  and  expensive  to  build  [2]. 

The  advantage  of  FPGAs  is  that  they  represent  a  compact  integrated  programmable 
hardware  solution  which  can  be  user  configured  for  any  conceivable  logic  design.  Current 
designs  contain  in  excess  of  40,000  logic  gates,  all  under  the  control  of  the  designer.  On  the 
other  hand,  FPGAs  have  some  notable  disadvantages.  First,  there  internal  routing  contributes 
substantial  delay  between  logic  elements  resulting  in  a  significant  limitation  in  performance, 
although  parallelism  and  pipelining  can  still  be  used.  The  second  disadvantage  is  that  it  is  not 
possible  to  execute  a  variety  of  arithmetic  operations  within  the  logic  resources  available.  Added 
to  this  is  that  the  programming  of  FPGAs  is  difficult,  especially  when  implementing  DSP 
flmctions  [9]. 

2.3  Using  FPGAs  for  DSP  Applications 

The  FPGA  has  recently  generated  interest  for  use  in  DSP  systems  because  of  its  potential  to 
implement  an  infinite  variety  of  custom  hardware  solutions  while  still  maintaining  the  flexibility 
of  a  conventional  programmable  device  [6].  Although  DSP  microprocessors  have  complete 
algorithm  flexibility,  their  performance  is  limited  because  algorithms  are  implemented  by 
sequential  MAC  operations,  as  previously  described.  Additionally,  DSP  microprocessors  have 
an  overhead  for  reading  in  the  operands  and  writing  the  result  through  a  single  data  port. 
Therefore,  a  DSP  microprocessor  may  require  at  least  four  cycles  (i.e.,  read,  multiply,  add  and 
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write)  to  perform  the  simplest  of  algorithms,  resulting  in  10  MIPS  performance  from  a  40  MIPS 
processor  [1]. 

Because  DSP  algorithms  are  optimally  mapped  to  the  device  architecture,  FPGA 
performance  can  significantly  exceed  DSP  processor  performance.  For  example,  a  DSP 
microprocessor  can  implement  an  8-tap  FIR  filter  at  5  Msps.  An  FPGA  can  implement  the  same 
FIR  filter  at  100  Msps  [1].  FPGAs  will  never  completely  replace  general  purpose  DSP 
processors,  however.  Current  generation  programmable  logic  addresses  only  the  fixed  point  DSP 
portion  of  the  market.  General  purpose  DSPs  still  dominate  in  floating  point  performance.  Also, 
general  purpose  DSP  processors  utilize  familiar  software  methods,  while  using  programmable 
logic  requires  a  completely  different  approach  on  the  part  of  the  DSP  designer.  Implementing 
DSP  fimctions  in  FPGAs  provide  the  following  advantages  over  conventional  DSP  hardware: 

a.  Parallelism  -  Using  FPGAs  can  lead  to  significantly  higher  performance  than  a  typical 
DSP  processor  for  some  applications. 

b.  Efficiency  -  An  FPGA  can  be  optimized  for  specific  algorithms,  thus  achieving  the 
performance  of  hardware  with  the  flexibility  of  software. 

c.  In-circuit  Reconfigurability  -  Permits  the  algorithm  or  fimction  to  be  changed  while 
operating  in-circuit.  An  additional  benefit  of  FPGAs  over  ASICs  is  that  they  can  be 
reprogrammed  on  the  fly  in  the  system.  Consequently,  a  single  FPGA  can  implement  different 
DSP  functions  at  various  times  in  a  system  to  boost  overall  performance. 

d. .  Adaptability  -  A  device  that  can  implement  large  internal  RAM  blocks  can  be  used  to 
implement  real-time  adaptive  functions  at  a  throughput  that  cannot  be  matched  by  conventional 
DSP  solutions. 
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2.4  Alternative  Arithmetic  Options  for  FPGA 

The  primary  limitation  of  the  FPGA  when  used  in  DSP  applications  is  arithmetic  -  most  notably 
multiplication.  When  FPGAs  are  used  for  DSP  applications,  the  multiplier  circuits  must  be 
implemented  with  the  available  chip  resources.  However,  a  hardware  multiplier  is  a  reasonably 
complex  circuit,  as  evidenced  by  the  fact  that  conventional  DSP  microprocessors  contain  only  a 
single  hardware  multiplier,  and  it  occupies  most  of  the  real  estate  on  the  chip.  A  state-of-the-art 
FPGA  can  support  no  more  than  a  handful  of  multipliers,  meaning  that  brute  force  multiplication 
is  often  avoided  in  some  of  the  most  common  operations  -  e.g.,  filtering  or  correlation. 

When  implementing  multipliers  in  hardware,  two  basic  alternatives  are  available:  The 
fully  parallel  array  multiplier  and  the  fully  bit-serial  multiplier.  The  advantage  of  the  fully 
parallel  array  multiplier  is  that  all  of  the  product  bits  are  produced  at  once  which  generally  results 
in  a  faster  multiplication  rate.  The  multiplication  rate  for  this  adder  is  simply  the  delay  through 
the  combinational  logic.  However,  parallel  multipliers  also  require  a  large  amount  of  area  to 
implement.  Bit  serial  multipliers  on  the  other  hand  generally  require  only  1/Nth  the  area  of  an 
equivalent  parallel  multiplier  but  take  2N  bit  times  to  compute  the  entire  product  (N  is  number  of 
bits  of  multiplier  precision)  [6].  This  concept  is  illustrated  in  Figure  2.  A  new  trend  is  to 
incorporate  a  limited  number  of  hardware  multipliers  within  the  FPGA.  For  example,  AT&T 
incorporates  a  4  X  1  multiplier  in  each  programmable  function  unit  of  its  ORCA  FPGA  family. 

Innovative  techniques  which  avoid  conventional  multiplication  in  computing  FIR  filters 
and  other  DSP  algorithms  have  been  investigated  by  a  number  of  researchers.  For  example  use 
of  distributed  arithmetic  techniques  have  been  reported  [12],  which  makes  extensive  use  of  look¬ 
up  tables,  an  approach  which  allows  a  considerable  savings  in  chip  resources. 

The  FPGA  has  the  ability  to  implement  filtering  and  transforms  using  one  of  several 
distributed  arithmetic  techniques,  depending  on  the  performance  required.  These  techniques  can 
be  used  to  optimize  the  implementation  of  many  other  types  of  data  processing  or  MAC-based 
algorithms.  Parallel  distributed  arithmetic  techniques  are  used  to  achieve  the  fastest  sample  rates 
,  while  lower  rates  can  be  sustained  with  a  serial  or  serial  sequential  distributed  arithmetic 
techniques  that  uses  less  resomces  (fewer  arithmetic). 
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2.4.1  Distributed  Arithmetic 

Distributed  Arithmetic  are  computational  algorithms  that  perform  multiplications  with 
look  up  tables.  This  algorithm  is  generally  used  to  perform  important  DSP  filtering  and 
fi'equency  transforming  fimctions.  Since  most  of  the  recent  architectures  of  the  programmable 
logic  have  supported  the  look-up  table  methodology  distributed  arithmetic  has  become  very 
popular. 


(N  bits,  serial) 


(M  bits,  serial  or  parallel) 


i 


Multiplication  time  = 
N+M  bit  times 


(N+M  bits  serial) 


product 

Bit  Serial  Multiplier 


(M  bits,  parallel) 


(N  bits,  parallel)  • 


Multiplication  time  = 
prop,  delay  of  logic 


product 
Y  out 
(N+M  bits  parallel) 

Parallel  Multiplication  Array 


Figure  2  -  Block  Diagrams  of  Basic  Multiplier  Alternatives  [6] 

Distributed  Arithmetic  differs  from  conventional  arithmetic  only  in  the  order  in  which  it 
performs  operations.  Take  for  example  a  four-product  MAC  function  that  uses  a  conventional 
sequential  shift  and  add  technique  to  multiply  four  pairs  of  numbers  and  sum  results.  The  four- 
multiplication  are  performed  simultaneously  and  the  results  are  then  summed  when  the  products 
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are  complete.  This  method  of  implementation  requires  n-clock  cycles  for  data  sample  of  n-bits. 
Hence ,  the  processing  clock  rate  is  equal  to  data  rate  divided  by  the  number  of  data  bits.  During 
each  data  clock-cycle,  the  four-multipliers  simultaneously  create  four-product  terms,  that 
eventually  are  summed  into  the  output.  The  distributed  arithmetic  differs  from  this  process  by 
adding  the  partial-products  before,  rather  than  after,  the  bit-weighted  accumulation. 

By  using  Distributed  Arithmetic,  the  operations  are  reordered.  The  reordering  reduces 
the  number  of  shift-and-add  circuits  to  one,  but  does  not  change  the  number  of  simple  adders. 
Distributed  arithmetic  is  of  two  types  the  serial  and  parallel  distributed  arithmetic. 

Distributed  arithmetic  is  useful  in  filtering  applications,  where  the  coefficients  are 
constant.  Adders  and  AND  gates  are  made  use  of  to  implement  multiplication  with  coefficients. 
But  in  distributed  arithmetic  the  AND  functions  and  adders  are  replaced  with  look  up  tables 
(LUT).  If  a  single  bit  is  made  use  of  to  access  the  LUTs  then  it  is  called  serial  distributed 
arithmetic,  where  the  incoming  sample  of  the  signal  stored  in  a  shift  register  and  a  bit  at  a  time  is 
shifted  out.  The  other  type  of  distributed  arithmetic  is  Parallel  distributed  arithmetic  (PDA), 
where  the  number  of  bits  used  to  access  the  LUTs  are  more  than  one.  The  overall  performance 
of  PDA  is  better  than  SDA  as  in  the  former  case  the  number  of  bits  processed  during  each  clock 
cycle  is  increased. 

2.4.2  Pipelined  Architectures 

Since  the  FPGA  CLBs  contain  flip-flops  ,  they  are  used  for  storing  and  delaying  the 
signal.  The  purpose  of  pipelining  is  to  increase  the  speed  at  which  the  system  operates  by 
decreasing  the  delay  of  the  critical  path  of  the  system.  The  Figure  3  below  illustrates  the 
concept  of  pipelining. 

In  this  figure  we  have  two  clocked  circuits  which  are  driven  by  the  same  clock.  Hence 
the  speed  of  operation  depends  on  the  delay  that  separates  the  two  clocked  circuits.  If  the  delay 
is  more  than  the  clock  period,  then  the  frequency  of  operation  is  limited  by  the  delay  between 
them,  to  increase  the  frequency,  register  elements  can  be  include  in  the  delay  path  thereby 
decreasing  the  delay  in  the  path  and  increasing  the  speed  of  operation.  This  is  shown  in  part  two 
of  Figure  3.  Pipelining  is  a  trade  off  between  speed  versus  the  resource  utilization.  Using  too 
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many  pipelined  stages  in  the  system  results  in  a  enormous  amount  of  hardware  resources  and  in 
designs  where  area  and  power  are  of  main  concern,  then  pipelining  may  not  be  advisable. 


2.4.3  Design  of  Data  Paths 

The  data  path  is  that  module  of  the  system  that  performs  the  data  processing  operations. 
The  design  of  the  data  path  in  any  system  design  is  very  important.  The  speed  of  the  system 
increases  if  there  are  a  number  of  data  paths  performing  the  same  function  in  parallel  but  this 
kind  of  system  design  requires  a  lot  of  hardware  resources  and  hence  it  is  important  to  decide 
whether  or  not  the  data  path  will  be  shared. 


A  critical  factor  in  designing  the  data  path  is  the  precision  requirements  of  the  design.  If 
very  high  precision  is  required,  especially  in  DSP  applications,  then  the  data  path  should 
implement  a  floating  point  processing  of  the  signals.  However,  this  would  lead  to  excessive 
usage  of  resources  since  two  registers  are  required  to  store  a  floating  point  number  in  the  data 
path,  hence  all  the  designs  make  use  of  fixed  point  number  data  paths. 

Though  fixed  point  data  path  design  leads  to  using  less  hardware  resources,  the  accuracy 
of  the  results  obtained  are  less  when  compared  to  the  floating  point  designs.  This  factor  is 
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important  for  programmable  logic  as  there  are  limited  resources.  By  making  use  of  fixed  point 
algorithms,  a  considerable  number  of  resistor  elements  can  be  reduced,  which  in  tmn  can  be 
used  to  implement  parallel  data  paths  and  pipelining  to  improve  the  speed  at  which  the  design 
operates, 

2.4.4  Routing  Delays 

Routing  Delays  play  a  major  role  in  the  hardware  design  since  they  limit  the  operating 
speed  of  the  system.  Very  complex  designs  often  suffer  firom  routing  delays  because  of  the  large 
number  of  logic  circuits  implemented  in  the  device,  resulting  in  less  space  available  for 
achieving  optimal  routing.  This  is  a  concern  for  DSP  systems  as  many  arithmetic  operations 
need  to  be  performed.  Arithmetic  elements  are  required  which  typically  occupy  a  large 
percentage  of  the  available  resources,  which  affects  the  routing  of  the  design. 

The  recent  trend  in  FPGAs  is  to  use  common  function  (or  macro)  blocks  to  aid  in 
developing  systems  with  lower  routing  delays.  For  example  the  Xilinx  FPGAs  supports 
XBOLX  modules  such  as  adders,  subtracters,  incrementors,  decrementors  etc. ,  which  are  used 
extensively  to  implement  arithmetic  functions  on  Xilinx  FPGAs  with  very  low  routing  delays. 
Thus  by  decreasing  the  routing  delays  we  can  increase  the  fi'equency  of  operation  of  the  system. 

2.5  FPGA  Applications  in  Software  Radio  Systems 

The  DSP  Functions  that  FPGAs  do  best  are  those  requiring  high  sample  rates  and  short  word 
length.  They  are  especially  suited  for  FIR  filter  designs  employing  lots  of  filter  taps  and  fast 
correlators.  The  lookup  table  architecture  of  FPGAs  provides  a  fast  and  efficient  way  to  build 
correlators  [3].  More  taps  can  be  added  to  the  parallel  filter  with  only  a  small  performance 
tradeoff  with  additional  parallel  silicon  resources.  In  contrast,  DSP  processors  exhibit  a  linear 
decrease  in  performance  as  the  number  of  taps  increases  (Table  1).  An  8-tap,  8-bit  FIR  filter 
implemented  on  an  Altera  device  needs  only  80%  more  silicon  than  one  8  x  9  bit  fixed  multiplier 
(Table  2)[1]. 
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Table  1  - 

Fully  Parallel  8-bit  FIR  Filter  (FLEX  8000A  FPGA) 

#  of  Taps 

Performance  (MSPS) 

Equivalent  MIPS  (DSP  Processor) 

8 

104 

832 

16 

101 

1,616 

24 

103 

2,472 

32 

105 

3,360 

Table  2  -  Silicon  Resource  Comparison 

Function 

Inputs  &  Outputs 

Flex  8000A  Logic  Cells 

FIR  Filter 

8-bit  data,  coeff 
17-bit  output 

296 

Fixed  Point  Multiplier 

8  X  9  bit  data 

17  bit  output 

164 

Table  3  shows  the  performance  of  multipliers  implemented  on  the  Xilinx  4000  family.  Note  that 
parallel  multipliers  require  a  larger  proportion  of  the  device,  while  bit  serial  implementations  are 
slower.  The  first  number  in  the  Multiplier  Speed  column  for  the  bit-serial  multipliers  is  the  clock 
speed,  while  the  second  number  is  the  multiplier  speed. 


Table  3  -  Xilinx  4000  Series  FPGA  Multipliers 

Type  of  Multiplier 

#CLBs 

%  of  FPGA 

Mult.  Speed 

8  bit  unsigned  (parallel) 

64 

16% 

8.54  MHz 

16  bit  unsigned  (parallel) 

242 

60% 

3.8  MHz 

8  bits  unsigned  (bit-serial) 

17 

4% 

73.1/4.6  MHz 

16  bit  unsigned  (bit-serial) 

33 

8% 

62/1.9  MHz 

FPGAs  can  efficiently  implement  IIR  filters.  For  example,  a  lookup  table  based  vector 
multiplier  can  be  used  to  create  a  complete  second  order  section  of  an  all  pole  analog  filter.  The 
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vector  multiplier  requires  the  same  resources  and  operates  at  the  same  speed  as  a  fixed  point 
multiplier.  A  Butterworth  filter  can  run  at  a  rate  of  25  Msps  and  require  only  139  logic  cells  [1]. 

Altera  has  developed  high  speed  FIR  filter  megafimctions  that  are  optimized  for  their 
own  FPGA  structure.  These  filters  can  be  implemented  in  parallel  or  serial  form  allowing  a 
tradeoff  between  silicon  resources  and  performance.  Parallel  filters  can  perform  at  rates  up  to 
100  Msps  enabling  digital  processing  of  RF-IF  data.  Serial  filters  require  less  logic  and  still 
perform  at  5  to  6  Msps.  In  a  Spread  Spectrum  RF  modem  application,  an  Altera  FPGA  can 
implement  the  receiver's  correlation  filter  fimction  at  a  chip  rate  over  60  MHz.  A  DSP  processor 
can  perform  the  remaining  tasks,  such  as  quadrature  phase  shift  key  (QPSK)  demodulation.  The 
resulting  DSP  application  can  deliver  six  times  the  data  rate  as  the  DSP  processor  alone. 

A  typical  DSP  algorithm  contains  many  feed-back  loops  or  parallel  structures  .  The 
software  code  for  a  DSP  algorithm  of  this  type  is  not  efficiently  implemented  in  general  purpose 
DSP  Typically  ,  about  10-30  percent  of  the  DSP  code  utilizes  60-80  percent  of  the  processors 
power.  Analyzing  the  DSP  algorithm  and  breaking  out  any  parallel  structures  or  repetitive  loops 
into  multiple  data  paths ,  one  can  enhance  the  overall  performance  of  the  algorithm.  To  increase 
the  speed  and  resource  utilization,  the  multiple  parallel  data  structures  can  be  processed  either 
through  parallel  DSP  devices  or  in  a  single  FPGA-based  DSP  hardware  accelerator  with  or 
without  the  assistance  of  a  DSP  device. 

The  use  of  FPGAs  are  well  suited  for  many  DSP  algorithms  and  fimctional  routines.  The 
FPGA  can  be  programmed  to  perform  any  niunber  of  parallel  paths.  Hence  by  implementing  the 
algorithm  in  parallel  paths  the  speed  at  which  the  algorithm  can  work  on  a  piece  of  hardware 
increases,  thus  one  very  important  use  of  FPGAs  in  the  field  of  DSPs  is  improving  the  speed  of 
operation.  The  operational  data  paths  can  consist  of  any  combination  of  simple  and  complex 
fimctions,  such  as  Adders,  Barrel  Shifters,  Counters,  Multiply  and  Accumulation  ,  Comparators 
and  Correlates  just  to  mention  a  few.  The  FPGA  can  also  be  partially  or  completely 
reconfigured  in  the  system  for  a  modified  or  completely  different  algorithm. 

The  primary  concept  is  to  unload  the  compute-intensive  fimctions  requiring  multiple 
DSP  clock  cycles  into  the  FPGA  and  allow  the  DSP  processor  to  concentrate  on  optimized 
single-clock  fimctions.  The  combined  fimctionality  of  the  FPGA  and  general-purpose  DSP  can 
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support  several  magnitudes  higher  data  throughput  than  two  or  more  parallel  DSP  devices.  The 
FPGA/DSP  implementation  is  more  flexible  and  proves  to  be  more  cost  effective  than  multiple 
DSPs  or  an  ASIC. 

3.  Rapid  Prototyping  Concepts 

Designing  with  FPGAs  requires  computer  assistance  at  almost  every  stage  of  the  design 
including  detailed  specification,  simulation,  placement  and  routing.  The  use  of  schematic 
capture  based  CAD  tools  is  a  common  approach  to  the  design  of  custom  logic  devices  using 
FPGAs.  This  process  is  often  combined  with  logic  level  simulation  to  verify  a  specific  design. 
One  method  of  increasing  the  range  of  architectural  solutions  that  a  designer  may  explore  in  a 
reasonable  time  is  to  specify  the  DSP  system  with  a  hardware  description  language  (HDL)  [10]. 
This  steps  the  design  process  up  one  level  and  allows  a  generic  functional  description  of  the 
target  system  which  can  be  further  simulated  or  implemented  directly  onto  an  FPGA  after  the 
HDL  code  is  converted  using  the  FPGA  manufacturers  software. 

In  DSP  applications,  arithmetic  circuitry  for  operations  such  as  addition,  subtraction  and 
multiplication  are  commonly  required.  These  arithmetic  circuits  can  be  designed  and 
implemented  by  employing  user-generated  or  manufacturer-provided  sub-circuits,  which  can  be 
reused.  However,  as  these  designs  can  only  be  simulated  at  the  logic  gate  level,  it  is  difficult  to 
verify  the  functional  performance  of  the  algorithms  being  implemented.  It  is  particularly 
difficult  to  determine  the  potential  undesirable  side  effects  of  finite  precision  arithmetic,  as  this 
may  require  that  large  data  sets  be  simulated  and  translated  from  numerical  values  to  logic  levels 
and  vice  versa  [10].  However,  new  software  tools  are  being  developed  which  raise  the  design 
process  to  yet  another  level,  allowing  the  designer  to  begin  at  the  system  level. 

Simulation  tools  such  as  Cadence's  Signal  Processing  Worksystem  (SPW)  now  have 
features  which  allow  the  engineer  to  design  hardware  logic  systems  and  DSP  fixed  point  systems 
using  the  traditional  block  diagram  functional  description  of  the  circuit.  This  design  is  then 
immediately  converted  into  a  hardware  description  language.  Other  SPW  tools  allow  the  design 
to  be  simulated  via  the  HDL  description  of  the  system  and  then  linked  into  a  manufacturers 
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software  tools  which  support  specific  devices.  Most  manufacturers,  in  the  interest  of  making 
their  product  more  attractive  to  their  customers,  have  developed  a  set  of  stock  logic  elements 
which  can  be  reused  within  their  device  to  assist  the  engineer  in  quickly  achieving  any  design. 

Once  suitable  design  tools  and  automatic  methods  are  perfected,  designers  and 
programmers  will  be  able  to  create  custom  hardware  circuitry  and  pipelines  to  suit  the  problem  at 
hand  -  the  term  'soft  hardware'  suggests  that  hardware  will  become  as  readily  created  and 
malleable  as  software.  In  a  practical  sense  this  will  mean  that  the  tum-around  time  for  custom 
hardware  will  be  just  as  short  as  software  development  is  today. 

3.1  Design  Flow 

Figure  3  below  gives  the  flow  of  rapid  prototyping.  The  flow  of  the  design  is  fi’om  the 
fimctional  description  of  the  system  to  hardware  implementation.  The  fimctional  description  of 
the  system  is  done  in  either  SPW(Signal  Processing  WorkSystem)  or  in  VHDL.  The  fimctional 
description  is  usually  is  at  a  high  level  i.e.,  basically  proper  fimctioning  of  the  algorithm  is  given 
importance.  A  reconfigurable  flat-form  is  made  use  of  to  aid  the  flow  of  rapid  prototyping.  To 
integrate  the  above  methodology  commercial  CAD  tools  are  used. 

The  system  to  be  implemented  is  fimctionally  described  using  the  Hardware  Design 
System  (HDS)  of  SPW.  The  algorithm  is  designed  using  the  blocks  available  in  HDS.  The 
fimctional  description  of  the  system  is  done  using  the  fixed  point  blocks  to  limit  the  use  of 
hardware  resources  which  are  limited  in  reconfigurable  hardware.  The  algorithm  developed  is 
simulated  in  SPW  environment  before  the  implementation  is  done  to  make  sure  that  the 
fimctional  description  of  the  system  is  correct.  To  implement  the  design  on  hardware  the  HDS 
section  of  SPW  provides  a  link  which  generates  VHDL  code  for  the  system  designed  in  HDS. 
The  generated  VHDL  code  is  used  for  synthesis  to  implement  the  system  on  the  targeted  device. 

3.1.1  Functional  Description  Using  SPW  and  VHDL 

The  other  way  to  fimctionally  describe  the  whole  system  is  in  VHDL.  The  advantage  of  using 
hand-coded  VHDL,  rather  than  the  VHDL  code  generated  by  the  schematic  tool  is,  for  very 
large  and  complex  designs,  the  schematic  capture  of  the  system  becomes  difficult  and 
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impractical.  Also,  the  hand-coded  VHDL  is  very  flexible  in  the  sense  that,  if  certain  enable 
signals  are  required  for  flip-flop,  counters  etc,  then  the  blocks  provided  by  the  HDS  need  to  be 
modified  and  then  used  in  the  system  design.  However,  in  the  case  of  hand-coded  VHDL  we 
can  describe  the  above  components  easily.  Also  the  generated  code  fi-om  a  schematic  tool  tends 
to  require  more  hardware  than  hand-coded  VHDL. 


Figure  4  -  Block  Diagram  of  Rapid  Prototyping  Flow 


3.1.2  Simulation  and  Synthesis 

The  next  step  in  the  design  flow  is  to  simulate  the  design  using  standard  tools,  that  verify 
the  whether  or  not  the  design  is  functionally  right.  If  the  simulation  results  are  satisfactory  then 
the  design  is  synthesized  using  standard  tools  that  target  the  design  to  FPGAs.  The  systems 
designed  using  SPW  ,  can  be  simulated  in  the  SPW  environment.  But  the  hand-coded  VHDL  , 
need  to  be  compiled  and  simulated  using  standard  simulation  tools  of  Mentor,  Synopsys,  etc. 
The  figure  3.2  below  shows  the  flow  followed  dming  the  simulation  of  the  system. 

Synthesis  is  the  procedure  that  makes  possible  the  implementation  of  the  system  on  the 
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targeted  hardware.  It  is  also  one  of  the  key  factors  which  aids  the  rapid  prototyping.  Synthesis 
tools  help  in  translating  the  high  level  design  into  gate  and  register  level  which  the  routing 
software  understands.  The  synthesis  tools  generate  netlist  files  that  are  used  by  the  routing 
software  to  generate  files  that  are  used  to  develop  the  hardware  physically.  For  example  the 
FPGA  Synopsys  compiler  generates  a  top  level  netlist  for  the  design  which  is  used  by  the  Xilinx 
software,  which  partitions,  places  and  routes  the  design.  Figure  6  below  gives  the  flow  of 
synthesis  procedure  followed  for  synthesizing  a  given  system. 

High  level  system  design  is  gaining  popularity  as  it  allows  the  designers  to  describe 
systems  at  a  high  level  using  schematic  capture,  VHDL  and  Verilog  deign.  The  high-level 
design  methodology  reduces  hbrary  and  technology  dependence,  enabling  re-targeting  to  other 
libraries,  such  as  an  FPGA  library,  with  greater  ease. 


THE  FLOW  OF  SIMULATION  OF  SYSTEM  DESIGN 


Figure  5  -  Block  Diagram  of  Simulation  flow 
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Figure  6  -  Block  Diagram  of  Synthesis  Procedure 

3.1.3  Implementation 

The  implementation  occurs  at  two  levels:  first  is  at  chip  level  where  the  entire  system  is 
partitioned  into  smaller  submodules  and  these  modules  are  implemented  on  the  FPGAs.  The 
second  level  of  implementation  is  at  the  system  level,  this  is  where  all  submodules  are  integrated 
and  the  entire  system  is  tested. 

Chip  level  implementation  which  is  accomplished  on  FPGAs  is  an  important  part  of  the 
rapid  prototyping  flow.  Synthesis  tools  provide  the  designers  with  netlist  files  of  the 
submodules  to  be  implemented.  During  the  synthesis,  constraints  are  provided  to  meet  the 
specifications.  The  reconfigurable  nature  of  FPGAs  aid  the  rapid  prototyping  methodology. 
The  design  of  the  modules  are  mapped  on  to  the  FPGAs  using  the  concept  of  partition, 
placement  and  routing.  Every  FPGA  has  its  own  placement  and  routing  software  to  map  the 
design.  The  software  partitions  the  design  and  places  the  logic  into  the  configurable  logic  blocks 
and  finally  does  the  routing  of  the  entire  design.  The  software  generates  a  bit  file  to  configure 
the  FPGA  device. 
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System  level  design  implementation  is  done  using  the  rapid  prototyping  board,  known  as 
the  field  programmable  circuit  board  (FPCB).  The  FPIC  (filed  programmable  interconnect 
component),  are  programmable  interconnect  components  which  form  the  core  of  the 
programmable  circuit  board.  For  example,  the  Aptix  MP3  reconfigurable  board  has  three 
programmable  interconnect  components  used  for  routing  purposes.  The  MP3  board  also 
supports  diagnostic  programmable  interconnect  components  which  aid  in  viewing  signals  on  the 
diagnostic  instruments.  The  FPIC  is  configured  through  a  Host  Interface  Module  (HIM)  ,  which 
transfers  data  from  a  workstation  to  program  the  FPIC.  A  Stand-alone  Program  Module  (SPM) 
can  be  utilized  to  perform  the  same  function  without  a  workstation. 

The  FPCB  provides  fully  automated  downloading  of  configuration  data  to  both  FPGAs 
and  FPIC  devices.  As  the  board  supports  the  combination  of  FPGAs  with  standard  com- 
ponents(memory,  DSP  and  microcontrollers)  makes  the  MP3  uniquely  suited  for  DSP  system 
prototyping. 

3.1.4  Verification 

The  rapid  prototyping  environment  helps  in  debugging  and  verifying  very  complex 
systems.  As  stated  earlier,  the  FPIC  devices  used  on  the  FPCB  are  of  two  types:  one  is  used  for 
routing  purposes  and  is  designated  as  an  FPIC(R)  device  .  The  other  type  of  device  is  the 
diagnostic  device,  which  is  used  for  probing,  debugging  and  verifying  signals  of  the  design  and 
designated  as  FPIC(D).  These  FPIC(D)  devices  can  be  connected  to  the  logic  analyzer  with  help 
of  diagnostic  pads.  The  software  for  the  board  is  called  AXESS,  and  it  programs  both  the 
diagnostic  FPIC  devices  and  logic  analyzer. 

This  setup  provides  a  very  powerful  debugging  capability,  since  each  signal  that  appears 
in  the  system  level  netlist  can  be  routed  through  one  or  more  FPIC(D)  devices  and  viewed  on  the 
logic  analyzer.  The  signals  to  be  viewed  are  selected  with  the  help  of  diagnostic  device  interface 
provided  by  the  software.  The  software  automatically  programs  the  diagnostic  FPIC  device  to 
display  the  selected  signals  on  the  logic  analyzer.  At  the  same  time,  the  diagnostic  interface 
facility  configures  the  logic  analyzer.  The  diagnostic  interface  provided  by  the  reconfigurable 
board  software  does  the  channel  assignment  and  labeling  of  the  waveform  displays. 
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3.2  Prototyping 

In  traditional  prototyping  approaches,  the  design  is  mapped  to  a  technology  that  allows  speeds 
such  that  all  interfaces  to  targeted  applications  can  operate  in  real  time.  But  rapid  prototyping  , 
provides  flexibility  for  system  emulation  technology  to  explore  architectural  and  implementation 
alternatives  available  for  achieving  the  desired  system  function. 

Prototyping  was  commonly  done  using  custom  printed  circuit  boards  and  wire  wrap 
technologies  until  the  design  complexity  became  too  large  to  make  these  approaches  feasible. 
The  new  technologies  such  as  FPIC,  FPCB,  and  FPGA  have  created  a  new  path  that  enables 
mapping  of  complex  logic  into  programmable  hardware  which  can  meet  the  real-time  operating 
frequencies  of  DSP  applications.  The  main  aim  of  rapid  prototyping  is  to  design,  implement  and 
verify  systems  quickly,  hence  aiding  in  bringing  products  faster  to  the  market  when  compared  to 
traditional  prototyping  methods. 


4.  Methodology:  12  Tap  FIR  Filter  using  LUT  Techniques 

Finite  Impulse  Response  (FIR)  filters  play  an  important  role  in  the  design  of  practical  discrete¬ 
time  systems.  At  the  heart  of  a  FIR  filter  lies  the  multiplication  function,  which  introduces  the 
coefficients  of  the  filter  in  the  design.  Each  filter  tap  has  its  own  multiplier,  which  gives  the 
product  of  the  input  data  with  the  coefficient.  When  implementing  a  FIR  filter  on  an  FPGA,  the 
multiplication  function  imposes  a  bottleneck  on  the  speed  performance  and  area  requirements  of 
the  design,  therefore  the  designer  should  focus  on  enhancing  the  performance  of  these 
multipliers  and  hence  of  the  whole  design.  Various  multiplication  techniques  include  Shifl-and- 
Add,  Adder  Tree,  Logical  Tree,  multiplication  by  a  power-of-two,  and  constant  coefficient 
multiplier  using  Look-Up-Tables  (LUT).  The  last  is  the  key  to  high  performance  in  FIR  filters 
with  fixed  coefficients.  This  report  describes  the  design  of  a  Low  Pass  12-tap  FIR  filter  using 
constant  coefficient  multipliers  implemented  on  a  Xilinx  4013  FPGA  using  the  Aptix  MP3 
prototyping  board. 
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4.1  Background 

FIR  filters  are  very  useful  in  DSP  applications  because  they  are  inherently  stable,  can  be 
employed  in  a  non-recursive  structure,  and  exhibit  linear  phase  characteristics  in  the  passband. 
The  general  Direct  Form  structure  of  an  FIR  filter  is  shown  in  Figure  7  below. 


Figure  7  -  Direct  Form  structure  of  an  FIR  filter 

The  algorithmic  form  of  the  Linear  Constant  Coefficient  Difference  Equation  (LCCDE)  which 
describes  the  system  is  given  by: 

y(n)  =  hoX(n)  +  h,x(n-l)  +  h2x(n-2)  + - +  h„.,x(n-m) 

For  linear  phase  response  of  the  filter,  the  impulse  response  must  satisfy  the  symmetry 
condition: 

h[M-n]  =  h[n]  for  n=0, 1 ,20.  .M 

The  general  Direct  Form  structure  shown  in  Figure  7  exhibits  excessive  redrmdant  hardware  and 
poor  timing  characteristics  when  implemented  on  hardware.  An  alternative  inverted  structure 
implementation  is  shown  in  Figure  8  below. 
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Figure  8:  Inverse  FIR  structure 


As  shown  in  the  inverse  structure,  the  data  samples  are  applied  to  all  the  tap  multipliers  the 
same  time,  therefore  processing  of  the  data  samples  is  done  in  parallel  and  hence  the  overall 
timing  performance  is  enhanced.  Also,  by  exploiting  the  symmetric  nature  of  the  coefficients, 
we  can  reduce  the  number  of  multipliers  needed  by  half  Moreover,  if  we  use  Look-Up-Tables 
instead  of  regular  multipliers,  the  time  delay  incorporated  in  the  multiplication  function  is 
dramatically  reduced.  Each  Look-Up-Table  in  each  Tap  contains  all  the  possible  products 
obtained  when  we  multiply  the  specific  tap  coefficient  with  the  incoming  data.  Therefore,  the 
data  bits  are  just  appHed  on  the  address  input  of  the  LUT  (which  is  basically  a  ROM)  and  the 
corresponding  "data  X  coefficient"  product  is  obtained  automatically  on  the  output  of  the  LUT. 

4.2  Design  with  SPW 

The  Signal  Processing  Worksystem  (SPW)  tool  provides  the  means  for  designing  the  system 
schematically  and  for  verifying  and  simulating  the  design.  SPW  is  a  powerful  block  oriented 
software  tool  suitable  for  designing  any  kind  of  DSP  systems.  The  Filter  Design  System  (FDS), 
which  is  a  part  of  SPW,  was  used  to  obtain  the  filter's  coefficients.  First,  the  filter's  fi-equency 
characteristics  (Low  Pass,  cutoff  frequency  etc)  were  given  as  input  to  FDS,  which  in  turn 
calculates  the  coefficients  and  the  number  of  taps  needed  to  meet  the  desired  specifications. 
Then,  the  Block  Design  Editor  (BDE),  which  is  another  subsystem  tool  of  SPW,  was  used  to 
design  the  filter  schematically  using  standard  DSP  blocks  like  adders,  multipliers,  delay 
elements  etc. 
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All  these  blocks  are  located  in  the  Hardware  Design  System  (HDS)  library  of  SPW, 
which  allows  the  use  of  fixed-point  arithmetic  in  the  design.  The  advantage  of  using  fixed-point 
arithmetic  is  that  we  can  accurately  model  the  real  behavior  of  the  digital  system  because  we 
don't  need  to  deal  with  loss  of  precision  when  using  floating  point  arithmetic  in  a  bit-limited 
digital  system.  After  the  system  is  designed  schematically,  the  Signal  Calculator  System  of  SPW 
is  invoked  to  simulate  the  operation  of  the  design.  The  Signal  Calculator  is  capable  also  of 
generating  fixed-point  signals  which  can  be  applied  to  the  design  and  verify  its  real  performance. 

4.3  Filter  Design 

The  FIR  filter  described  in  this  report  has  the  following  characteristics: 

Type:  Low  Pass  FIR 
Tap  length:  12 

Cutoff  fi'eauencv:  fc=0.1Fs  (Fs=sampling  frequency) 

Stopband  edge:  0.13Fs 
Stopband  Attenuation :  30  dB 
Filter  Method:  Equiripple/Low  Pass 
Input  Data  width:  8  bits 
Output  Data  width:  12  bits 
Coefficients:  8  bits 

Using  the  Filter  Design  System  (FDS)  which  is  a  part  of  the  SPW  design  tool,  the  coefficients  of 
the  filter  were  obtained  (in  Double  precision  format) 

bo  =  b„  =  0.040473 
b,=b,o  =  0.075372 
b2  =  b,  =  0.11826 
b3  =  bg  =  0.1 6903 
64  =  67  =  0.20653 
65=60  =  0.22994 
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Figure  9  below  illustrates  the  block  diagram  of  the  design.  As  shown,  the  design  is  implemented 
as  a  parallel  inverse  structure  and  only  6  Look-Up-Table  (LUT)  blocks  are  used  because  the  12 
coefficients  are  symmetric. 


rata  Input  (tMts) 


As  discussed  before,  the  LUT  blocks  are  used  instead  of  regular  multipliers.  The  internal 
structure  of  each  LUT  block  is  shown  in  Figxire  10  below.  The  incoming  8-bit  data  is  split  into 
two  segments  of  4-bits  each.  Each  4-bit  segment  is  used  to  address  a  ROM  Look-Up-Table.  So, 
each  LUT  block  shown  in  Figure  8  it  actually  contains  two  ROM  LUTs.  We  could  have  had 
only  one  ROM  LUT  and  apply  all  the  eight  bits  of  the  data  on  it,  but  this  would  be  space 
consuming  on  the  FPGA  because  it  would  need  a  ROM  with  2*  =  256  memory  locations.  By 
splitting  the  data  in  two,  each  ROM  has  now  2‘‘  =  16  memory  locations. 

The  upper  (Most  Significant)  LUT  contains  all  the  partial  products  of  the  8-bit 
coefficient  times  the  most  significant  4-bits  of  the  data  (i.e  16  partial  products).  Similarly,  the 
lower  (Least  Significant)  LUT  contains  all  the  possible  partial  products  of  the  8-bit  coefficient 
times  the  least  significant  4-bits  of  the  data.  Therefore,  at  the  output  of  each  LUT  we  have  a  12 
bit  partial  product  (4  bits  data  +  8  bits  coefficient).  Each  12  bit  partial  product  is  appropriately 
zero  padded  and  then  Summed  to  produce  the  final  product  at  the  output  of  the  Tap. 


Figure  10  -  Internal  Structure  of  LUX 


4.4  Results 

4.4.1  Filter  Performance 

The  design  was  implemented  on  a  Xilinx  4013  10  FPGA  on  the  Aptix  MP3  protot5^ing 
board.  Using  the  Synopsys  and  Xilinx  tools,  the  FPGA  area  used  and  the  max  time  delay  of  the 
filter  were  obtained  as  shown  on  Table  4  below.  Table  5  is  a  comparison  of  the  filter  described 
in  this  report  with  two  other  filters  designed  before. 


Table  4  -  Area  and  timing  performance  results 

Total  number  of  CLBs  used 

302 

%  space  of  4013  FPGA  used 

52% 

Max.  data  arrival  time 

51.65  nsec 

(=max  delay) 

Max.  filter  clock  speed 

20  MHz 
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Table  5  -  Comparison  of  Area  &  Timing  Performance  of  3  Filter  designs 

Filter  Type 

Total  #  of  CLB's 

Max.  Data  Delay 

Max.filter  clock 

FIR  8-tap  with 

regular  multipliers 

140  (24%) 

87.8  nsec 

11.4MHz 

FIR  12-tap  with 

regular  multipliers 

329(57%) 

130.36  nsec 

7.5  MHz 

FIRl  2-tap  with 

LUT  constant 

coefficient 

multipliers 

302(52%) 

51.65  nsec 

20  MHz 

As  shown  from  Table  5  above,  the  LUT  based  FIR  filter  has  much  better  speed  performance 
compared  with  the  two  other  filters  which  use  regular  multipliers  in  the  design.  Even  the  8-tap 
filter,  which  uses  much  less  space  than  the  12-tap  LUT  based  filter,  has  bigger  time  delay  in  the 
Data  path  compared  with  the  12-tap  LUT  based  design. 

5.  Results  of  the  Implementation 

Figure  1 1  below  shows  the  theoretical  frequency  response  of  the  filter  obtained  using  Double 
Precision  arithmetic  for  the  coefficients.  On  the  other  hand.  Figure  6  illustrates  the  experimental 
frequency  response  with  fixed  point  coefficients.  From  Figure  5  we  observe  that  there  is  a  steep 
roll-off  at  the  cut  off  frequency  (O.lFs).  At  this  cut  off  point  we  can  see  that  the  magnitude  of  the 
response  falls  by  -3dB  from  the  maximum.  This  graph  was  obtained  using  Double  Precision 
arithmetic  for  the  coefficients  which  means  that  it  gives  us  the  desired  frequency  response  with 
the  characteristics  given  from  the  Filter  Design  System  tool.  In  Figure  12  however,  we  observe 
that  the  real  frequency  response  of  the  filter  differs  from  the  desired  theoretical  response  of 
Figure  10.  The  -3dB  point  on  the  experimental  response  occurs  at  a  normalized  frequency  of 
O.OSFs . 
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As  shown  on  Figure  12,  the  -3dB  point  occurs  at  around  0.08  Fs  and  not  at  O.lFs  which 
is  the  design  specification.  This  happens  mainly  because  of  the  quantization  error  introduced  on 
the  coefficients.  According  to  our  design,  the  coefficients  of  the  filter  are  represented  by  an  8-bit 
fixed  point  binary  number.  Therefore,  when  the  Double  Precision  coefficient  number  is 
represented  by  the  8-bit  fixed  point  number,  there  is  loss  of  precision  because  of  quantization. 
This  error  can  change  the  filter's  characteristics,  and  especially  the  firequency  response  and  cut 
off  point,  because  the  original  value  of  the  coefficients  changes  after  quantization.  For  example, 
coefficient  bo  has  a  value  of  bo=0.040473.  When  this  number  is  represented  with  8-bit  fixed 
point  (two's  compliment)  arithmetic,  it  becomes  0.0390625  which  is  the  closest  approximation 
to  the  original  number.  Other  than  that,  the  fi'equency  response  of  the  filter  shows  good  rejection 
characteristics  and  meets  the  -30dB  rejection  in  the  stopband  as  specified  in  the  design. 
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Figure  12  -  Experimental  Frequency  Response  (fixed-point  coefficients) 


6.  Conclusions  and  Recommendations 

This  report  has  briefly  examined  the  emerging  field  of  FPGAs  and  their  application  to  traditional 
DSP  problems.  We  compared  the  computational  power  of  the  FPGA  to  that  of  the  DSP 
processor,  and  we  found  that  FPGAs  are  more  suited  to  simple  algorithms  executed  at  high 
speed,  while  DSP  microprocessors  are  better  suited  for  slower,  more  complex  tasks.  DSP 
microprocessors  are  limited  by  their  serial-based  architecture,  while  FPGAs  can  implement  any 
architecture  which  may  be  optimum  for  a  particular  algorithm.  On  the  other  hand,  we  recognized 
the  shortcomings  of  the  FPGA  -  these  devices  are  essentially  still  in  their  infancy,  from  a 
technological  standpoint,  and  there  are  problems  to  be  solved  by  the  manufacturers  before 
FPGAs  seriously  challenge  DSP  microprocessors  in  the  DSP  market. 

Designing  with  FPGAs  is  still  a  difficult  task.  The  software  necessary  for  efficient  FPGA 
design  is  extremely  expensive  and  rare.  Often,  a  considerable  amoimt  of  gate-level  modification 
is  necessary  on  the  part  of  the  engineer  to  realize  the  performance  that  the  FPGA  is  capable  of 
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delivering.  The  current  cost  of  FPGAs  is  an  indication  of  their  novelty  in  the  marketplace. 
Prices  Avill  need  to  drop  by  several  orders  of  magnitude  before  FPGAs  will  be  a  common 
commodity  in  the  DSP  designers  bag  of  tricks. 

We  also  presented  the  design  of  a  Low  Pass  12-tap  FIR  filter  using  Constant  Coefficient 
Multipliers  implemented  in  Look-Up-Tables.  Compared  with  filter  designs  which  use  regular 
multipliers,  this  filter  exhibited  much  better  speed  performance  and  area  occupation  on  the 
FPGA.  A  maximum  delay  of  51.65nsec  was  obtained  which  allows  the  filter  to  operate  on  clock 
speed  of  20  MHz.  The  firequency  response  of  the  filter  shows  good  rejection  characteristics  ( -30 
dB  in  the  stop  band),  but  due  to  quantization  error  introduced  on  the  coefficients,  the  cut  off 
fi'equency  is  shifted  firom  O.lFs  to  O.OSFs. 

This  design  occupies  52%  of  a  Xilinx  4013  FPGA,  which  means  that  there  is  still  enough 
space  on  the  FPGA  to  expand  the  design  with  more  taps.  The  benefit  of  this  will  be  better 
fi'equency  response  characteristics  with  a  trade  off  on  delay  increase  since  the  data  will  have  to 
travel  in  longer  paths.  Investigation  of  this  expansion  is  plaimed  for  the  future.  Also,  future  work 
includes  investigation  of  some  other  design  techniques  for  delay  reduction  (pipelining  for 
example).  Other  types  of  filters  are  also  under  investigation  (High  Pass,  Band  Pass  etc). 
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ABSTRACT 

The  wavelet  transform  decomposes  a  signal  or  image  into  components  on  the  basis  of  position  as 
well  as  scale.  Self-similarity  across  scales  is  an  essential  attribute  of  fractal  properties  of  images.  Also,  a 
scale  analysis  of  projections  of  objects  provides  feature  vectors  that  are  invariant  to  scaling  and  rotation. 
Both  fractal  analysis  and  scale  analysis  of  projections  have  been  found  to  possess  good  discriminatory 
properties  in  the  isolation,  detection  and  classification  of  concealed  objects  in  images.  The  wavelet 
transform  provides  a  natural  framework  for  the  implementation  of  these  analyses.  In  fact,  the  only  known 
wavelet  matching  techniques  for  optimum,  matched  filtered  pattern  recognition  are  for  bandlimiled 
wavelets.  Accordingly,  this  research  effort  investigated  the  development  of  techniques  for  fast 
implementation  of  bandlimited  wavelets  and  the  development  of  a  systems  framework  for  scale  analysis  of 
discrete  signals  and  images.  The  investigation  has  led  to  the  development  of  perfect  reconstruction  circular 
convolution  filter  bank  structures,  construction  of  continuous-dilation,  linear,  scale-invariant,  discrete-time 
self-similar  signals  and  systems,  and  formulation  of  the  scaling  mixture  problem  for  multichannel  angle  and 
doppler  estimation  for  wideband  signals. 
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WAVELET  TRANSFORM  TECHNIQUES  FOR  ISOLATION, 
DETECTION  &  CLASSIFICATION  OF  CONCEALED 

OBJECTS  IN  IMAGES 


Raghuveer  M.  Rao 


1  Introduction 

The  wavelet  transform  decomposes  a  signal  or  image  into  components  on  the  basis  of  position  as  well  as 
scale.  Self-similarity  across  scales  is  an  essential  attribute  of  fractal  properties  of  images.  Also,  a  scale 
etnalysis  of  projections  of  objects  provides  feature  vectors  that  are  invariant  to  scaling  and  rotation. 
Both  fractal  analysis  and  scale  analysis  of  projections  have  been  found  to  possess  good  discriminatory 
properties  in  the  isolation,  detection  and  classification  of  concealed  objects  in  images.  The  wavelet 
transform  provides  a  natural  framework  for  the  implementation  of  these  analyses.  In  fact,  the  only 
known  wavelet  matching  techniques  for  optimum,  matched  filtered  pattern  recognition  are  for  bandlim- 
ited  wavelets  [4].  Accordingly,  this  research  effort  investigated  the  development  of  techniques  for  fast 
implementation  of  bandlimited  wavelets  and  the  development  of  a  systems  framework  for  scale  analysis 
of  discrete  signals  and  images.  The  investigation  has  led  to  the  development  of  perfect  reconstruction 
circular  convolution  filter  bank  structures,  construction  of  continuous-dilation,  linear,  scale-invariant, 
discrete-time  self-similar  signals  and  systems,  and  formulation  of  the  scaling  mixture  problem  for  mul¬ 
tichannel  angle  and  doppler  estimation  for  wideband  signals  and  the  results  are  presented  under  the 
appropriate  headings. 


2  Perfect  Reconstruction  Circular  Convolution  (PRCC)  Filter 
Banks 

Our  research  into  fast  implementation  structures  for  bandlimited  wavelets  has  led  to  what  we  call  perfect 
reconsiruciton  circular  convolution  (PRCC)  filter  banks  [1,  2,  3,  8].  These  filter  banks  are  designed 
entirely  in  the  discrete  Fourier  transform  (DFT)  domain  and  satisfy  perfect  reconstruction  properties 
in  the  discrete  frequency  domain.  The  formulation  of  these  filter  banks  is  made  on  the  assumption  of  a 
finite  length  input  sequence  of  length  N.  In  addition,  all  the  operations  associated  with  the  filter  bank 
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implementation,  namely,  filtering,  downsampling  and  upsampling  can  be  implemented  entirely  in  the 
discrete  frequency  domain.  To  see  how  this  can  be  done,  we  first  interpret  the  processes  of  downsampling 
and  upsampling  in  terms  of  the  DFT.  Note  that,  during  this  discussion  we  use  the  notations  X{k)  and 
interchangeably  to  denote  the  DFT  of  an  A’-length  sequence  ar(n). 


2.1  Downsampling 

Consider  Figure  1  which  shows  an  M-fold  downsampler.  The  input  to  an  M  fold  downsampler  is  a 
sequence  z{n)  of  N  samples.  The  output  of  the  downsampler,  denoted  by  y(n),  is  given  by  [7] 

y(n)  =  x{Mn)  (1) 

where  the  sequence  y{n)  has  N/M  samples.  It  is  assumed  that  N  is  an  integer  multiple  of  M. 

To  obtain  an  expression  for  the  DFT  of  the  output  sequence  y(n),  we  first  define  an  A^«length 
intermediate  sequence 

f  n=multiple  of  M 


xi(n)  = 


0  otherwise 


Clearly 


Now  the  DFT  of  y{n)  is 


j/(n)  =  x{Mn)  =  xi{Mn) 


N/M-l 


^  k  =  0,1, N/M  -  1 


Substituting  (3)  into  (4)  gives 


NjM-l 


E  it  =  0, 1, ,  N/M  -  1 


Since  xi(n)  is  zero  at  all  n  except  when  n  is  a  multiple  of  M,  (3)  is  equivalent  to 


)=  k  =  0,l,...,NIM -1 


Now,  we  can  represent  Xi(n)  in  terms  of  x(n),  we  define  a  comb  sequence 

-  M-l 


Observe  that  c^f  (n)  the  Inverse  DFT  of  a  sequence  of  M  I’s  and  hence  is  periodic  with  a  period  of  M . 
Thus, 

f  1  n=multiple  of  M 


CM{n)  — 


0  otherwise. 
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X  / 


Figure  1:  An  M-fold  downsampler. 
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Using  (7)  and  (8)  we  can  now  write 


xi(n)  =  CM{n)x{n) 


,  A/-1 


Substituting  (10)  into  (5)  gives 


AT-i  A/-1 


Y(,,2rki{NiM)^  =  a7  S  YL  ifc  =  0, 1, . . . ,  AT/M  -  1 


n=0  /=0 


which  can  be  simplified  to 


-  Af-i 

y(eJ2'*/(^/A/))  =  -L 


^  =  0, 1, . . .,  A//A/  -  1.  The  derivation  given  above  is  very  similar  to  that  presented  in  [7]  for  general 
sequences.  Note  that  y(A;)  is  N/M  periodic.  In  other  words,  the  IDFT  of  the  first  N/M  samples 
of  y(I;)  give  the  downsampled  version  of  the  input  sequence.  From  (12)  we  see  that  the  DFT  of  the 
downsampled  sequence  y(A;)  is  a  sum  of  M  coefficients  of  the  DFT  of  the  input  to  the  downsampler, 
X{k),  spaced  N/M  coefficients  apart.  For  example,  when  M  =  2,  the  steps  involved  are  the  following: 

•  Take  the  DFT  of  j:(n) 

•  Add  the  DFT  of  x{n)  and  its  N/2  rotated  version.  This  makes  use  of  the  N  periodicity  of  X(k). 

•  Divide  the  resulting  sequence  by  2. 

•  Take  the  IDFT  of  the  first  N/2  samples. 

This  creates  the  N/2  point  dowsampled  sequence,  y(n). 

2.2  Upsampling 

The  block  diagram  of  an  upsampler  is  shown  in  Figure  2.  An  Infold  upsampler,  where  L  is  an  integer, 
takes  an  input  sequence  z(7j)  of  N  samples  and  produces  an  output 


y(”)  = 


x{n/L)  n=multiple  of  L 
0  otherwise. 


The  output  sequence  j/(n)  has  NL  samples.  The  DFT  of  y{n)  is  given  by 


Y(^^j2Tk/{NL)~^  _  ^ 
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A:  ==  0, 1, . . . ,  NL  —  1.  Noting  that  only  each  =  0, 1, . . —  1  sample  of  the  upsampled  sequence 

are  non-zero,  we  can  write  (14)  as 

^  ^  (15) 

n=:0 

A:  =  0, 1, . .  .,NL  —  1.  This  can  be  simplified  to 

y  (e^2’r*/(^^))  =  Jfc  =  0, 1, . . . ,  7VL  -  1  (16) 

In  words,  the  NL  length  DFT  of  the  upsampled  sequence  is  nothing  but  a  concatenation 

of  L  DFTs  of  x{n).  The  derivation  of  (16)  is  again  along  lines  very  similar  to  the  derivation  given  in  [7] 
for  an  infinite  sequence. 

2.3  Procedure  for  Implementation 

PRCC  filter  banks  are  filter  banks  designed  and  implemented  completely  in  the  discrete  frequency  domain 
and  satisfy  the  conditions  for  perfect  reconstruction  over  a  discrete  set  of  frequencies.  The  operations  of 
downsampling,  upsampling  and  filtering  are  carried  out  entirely  in  the  discrete  frequency  domain.  The 
basic  procedure  for  their  implementation  can  be  understood  by  referring  to  the  Figure  3.  The  analysis 
side  of  the  system  is  shown  in  Figure  3(a).  Here,  we  first  take  the  DFT  of  the  N  length  input  signal 
a:(n).  Next,  we  multiply  this  DFT,  X{k)^  pointwise  with  a  sequence  Ho{k),  which  is  the  DFT  of  an  N 
length  sequence  /io(n).  This  amounts  to  circularly  convolving  sequences,  ho{n)  and  x{n).  The  resultant 
sequence  is  then  downsampled  by  two  as  explained  in  section  2.1.  The  procedure  is  repeated  for  the 
lower  branch  with  Hi{k),  the  DFT  of  hi{n).  In  this  manner,  we  decompose  the  input  sequence  into  two 
sequences  of  length  N/2  whose  DFTs  we  denote  by  Vo(^)  and  Vi{k)  respectively. 

The  synthesis  side  of  the  system  is  shown  in  Figure  3(b).  Here,  we  upsample  the  sequences  Vo(k)  and 
Vi{k)  as  explained  in  Section  2.2.  This  gives  us  two  sequences  Uo{k)  and  Ui{k)  of  length  N.  These  are 
then  multiplied  pointwise  with  Fo{k)  and  Fi{k)  which  are  the  DFTs  of  N  length  sequences  fo{n)  and 
fi(n)  respectively.  They  are  the  synthesis  filters  corresponding  to  the  analysis  filters  /io(n)  and  hi(n) 
respectively.  The  output  of  the  synthesis  filter  bank  is  thus  given  by  Xnik)  =  Fo{k)Uoik)-^  Fi(k)Ui(k). 
The  reconstructed  signal  is  then  the  inverse  DFT  (IDFT)  of  XR{k),  namely,  XR{n). 

Note  that,  the  PRCC  filter  bank  is  a  framework  based  in  the  discrete  frequency  and  is  different  from 
the  work  proposed  by  several  authors  before  on  fast  implementation  of  FIR  filter  banks  based  on  the 
FFT  [6].  We  now  present  the  conditions  for  perfect  reconstruction  for  the  PRCC  banks  in  the  next 
subsection. 

2.4  Conditions  for  Perfect  Reconstruction 

Since  the  input  sequence  is  of  length  N  it  follows  from  the  previous  subsection,  that  the  analysis  and 
the  synthesis  bank  filters  also  have  a  support  of  N  samples.  Furthermore,  we  assume  that  N  is  even  and 
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(N/2)  I - 1 

upsample  by  2 


(N)Y 

F,ik) 


(b) 


Figure  3:  PRCC  filter  bank  system,  (a)  analysis  filter  bank  and  (b)  synthesis  filter  bank 


all  the  sequences  and  filters  are  real  valued.  The  conditions  for  perfect  reconstruction  in  this  case  are 
obtained  in  a  manner  similar  to  that  described  in  [7].  They  are  cyclic  counterparts  of  the  corresponding 
linear  relationships.  The  transfer  function  of  the  analysis-synthesis  system  is  given  by 


(17) 

for  i  =  0, 1, . . . ,  —  1.  The  second  term  in  (17)  is  the  alias  term.  To  cancel  this  term  we  choose 

Fi(eJ2xi/A?)  =  (19) 


In  addition,  substituting  (18  and  (19)  into  (17)  gives 

X[e^^-klN)  ^  1  _  JJ  '  ^ ) 

Thus,  for  perfect  reconstruction  we  require 


(20) 


^o(eJ2^*/^)/fi(_eJ2^*/^)  _  Fi(e-’2^*/^)Fo(-e-’2^*/^)  =  (21 ) 


Ar=:0,l,...,A^  —  1.  Here  c  is  a  real  constant  and  L  corresponds  to  a  circular  shift. 

Following  [7]  we  require  that  satisfy  the  power  complementarity  condition  given  by 

I  Fo(eJ^'*''^)  p  +  I  |^=  2  (22) 

1;  =  0, 1, . . .,  —  1.  In  other  words  we  require  that  |  P  be  a  half  band  filter  or  an  equiripple 

filter  that  is,  antisymmetric  about  uj  =  ;r/2  or  equivalently  n  =  N/4.  For  perfect  reconstruction  we 
choose 

^l(e;2T«:/A^)  ^  _^2^k/Njj^(^_^-:2^k/N^ 

Using  (23)  in  (18)  and  (19)  we  get 


(23) 

(24) 

(25) 


Equations  (22),  (23),  (24)  and  (25)  ensure  that  the  filters  satisfy  the  equivalent  of  paraunitary 
conditions  in  this  domain  [1,  7]  and  hence  satisfy  the  cyclic  orthogonality  relationships  given  by 


N-l 

Y,hiin)hj{{n  +  2e)modN)  =  6{i,j)6i2imodN,0)  (26) 

n  =  0 

where  £  £  Z,  the  set  of  integers  and  i,j  =  0, 1.  Note  that  these  are  the  cyclic  equivalents  of  similar 
relationships  satisfied  by  orthogonal  or  paraunitary  filter  banks  [7]. 
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2.5  Procedure  for  Constructing  PRCC  Filter  Banks 

To  obtain  the  filters  and  we  first  need  to  design  the  prototype 

filter  For  this  we  require  the  half  band  filter  defined  as 

k  =  0,1, . .  .,N  —  1.  Note  that  is  a  zero  phase  filter.  Thus,  the  design  of  )  or 

if(ik)  is  equivalent  to  assigning  a  value  to  each  DFT  coefficient  as  follows.  Assuming  0  <  H(k)  <  1,  for 
some  H(k) 


1.  H{N  -k)  =  H{k) 

2.  H{N/2  -  jfe)  =  1  -  Hik) 

3.  H{N/2  +  k)^  H(N/2-k) 

Ho{k)  can  now  be  designed  by  taking  into  account  the  fact  that 

ff(e^2xt/JV^  _|  |2 

(28) 

Therefore, 

1  /fo(eJ^'*/^)  1= 

(29) 

Given  that  in  general,  L7o(e'’2’^*/^)  has  the  form 

i7o(e^2x<:/Ar)  =1  Fo(e'’2’^*/^)  1 

(30) 

we  can  now  add  the  phase  term  Since  we  require  that  /lo(^)  real,  <^(k)  is  antisymmetric  about 
N/2.  The  filters  Hi{k),  Fo{k)  and  Fi{k)  can  now  be  derived  using  the  relations  (23),  (24)  and  (25). 
Example.  N  =  8. 

Let  H{0)  =  0.75.Then  H{4)  =  0.25 
Let  H{1)  =  0.37.  Then  H{7)  =  0.37 
and  J7(3)  =  H{b)  =  0.63 
Finally,  H{2)  =  1  -  H{2)  =  H{6)  =  0.5. 

Thus, 

H{k)  =  {0.75,0.37,0.5,0.63,0.25,0.63,0.5,0.37),  and  hence 
hin)  =  {0.5, 0.0166, 0.0, 0.1084, 0.0, 0.1084, 0.0, 0.0166} 

Note  that  the  non-zero  even  indexed  points  of  h{n)  have  value  0. 

From  (29),  this  gives  us  |  Ho{k)  |=  {0.866,0.6082,0.7071,0.7937,0.5,0.7937,0.7071,0.6082} 

Let  us  choose 

(j){k)  =  {0, 1.1168, 0.2302,  -2.6746, 0.0, 2.6746,  -0.2302,-1.1168} 

This  gives  ho{n)  =  {0.2355,0.2746,0.2215,-0.1162,0.4565,-0.13721,-0.2305,0.1618} 

This  completes  the  design  of  of  Ho{k).The  filters  Hi{k),  Fo{k)  and  Fi{k)  can  now  be  determined.  Note 
the  flexibility  and  ease  of  design  that  this  method  offers  for  designing  filters  to  specification. 
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2-6  Frequency  Sampled  Implementation  of  Bandlimited  DWT 

The  power  spectrum  of  the  Meyer  scaling  function  |  p,  satisfies  the  conditions  of  an  interpolating 
filter  in  that  it  is  antisymmetric  about  a;  =  tt.  It  follows  that  if  it  is  properly  sampled  in  the  frequency 
domain  then  these  properties  can  be  retained  and  a  half  band  filter  can  be  obtained.  To  determine  the 
rate  at  which  it  needs  to  be  sampled,  it  is  important  to  note  that  the  function  needs  to  be  sampled 
symmetrically  about  the  angular  frequency  of  tt  units.  This  means,  if  the  required  filter  size  is  N 
samples,  where  N  is  assumed  even,  then  the  samples  should  be  Aw  =  4ir/N  units  apart  in  frequency  so 
that  the  samples  are  antisymmetric  about  A^/4.  The  square  root  of  these  samples  gives  the  samples  of 
the  low  pass  filter  Ho{k),  which  can  now  be  used  in  the  PRCC  framework.  The  shape  of  the  low  pass 
and  high  pass  filter  thus  obtained  are  shown  in  Figure  4. 

3  Continuous-dilation  discrete-time  self-similar  signals  and  lin¬ 
ear  scale-invariant  systems 

This  research  addresses  the  problem  of  defining  and  representing  discrete-time  self-similar  signals  and 
systems.  The  study  of  the  discrete-time  self-similar  processes  was  motivated  in  part  by  the  previous  work 
of  Wornell  and  colleagues  [10,11,9]  in  rontinuous  time.  They  provide  formulations  involving  continuous¬ 
time,  scale-invariant  signals  and  systems.  They  also  provide  a  detailed  study  of  such  systems  for  dyadic 
scale  factors.  Our  research  provides  answers  to  questions  such  as:  Is  it  possible  to  define  purely  discrete¬ 
time,  self-similar  signals?  Are  there  formulations  of  discrete-time,  scale-invariant  systems?  How  do 
we  provide  a  definition  of  dilation  or  scaling  of  discrete-time  signal  that  is  general  enough  to  provide 
non-trivial  self-similar  signals  and  scale-invariant  systems?  The  answer  to  the  third  question  holds  the 
key  to  answering  the  first  two  questions.  A  key  result  of  the  research  is  the  demonstration  of  the  fact 
that  it  is  possible  to  define  scaling  or  dilation  in  such  a  way  that  is  continuous  even  though  the  signal 
itself  is  discrete-time.  Hereafter,  we  will  use  the  term  scaling  exclusively  to  mean  dilation.  Using  this 
definition  of  scaling,  we  develop  definitions  and  constructions  of  deterministic  and  stochastic,  discrete¬ 
time,  self-similar  signals  and  discrete-time  scale-invariant  systems. 

3.1  Scaling  in  Discrete-Time 

3.1.1  Discrete- Time  Scaling  Operation 

Generally  the  scaling  or  dilation  operation  of  a  discrete- time  signal  x(n)  by  an  arbitrary  factor  is  not 
well  defined.  It  is  difficult  to  obtain  an  interpretation  of  scaling  in  the  discrete-time  domain  that  is 
as  unambiguous  as  that  in  the  continuous-time  domain.  Operations  such  as  upsampling,  interpolation, 
downsampling  and  fractional  sampling  rate  alteration  [7]  can  have  a  scaling  interpretation.  However, 
such  operations  cannot  handle  scaling  factors  over  a  continuum.  We  present  here  a  different  approach 
to  discrete-time  scaling  that  can  handle  continuous  scaling  factors.  We  define  the  discrete-time  scaling 
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operation  in  a  way  that  effectively  amounts  to  converting  x(n)  into  a  continuous-time  signal  through 
an  inversible  mapping,  applying  the  scaling  operation  to  the  continuous-time  signal  and  finally  inverse 
mapping  the  signal  back  to  the  discrete- time  domain.  The  actual  definition  is  based  on  operations  in 
the  frequency  domain. 

Let  f{t)  be  a  continuous-time  signal  and  F(f2)  its  Fourier  transform: 

/+ 00 

(31) 

•oo 

where  — oo  <Q  <  +oo.  If  /(<)  is  scaled  by  a  (a  >  0),  its  Fourier  transform  becomes 

•^{/(</“)}  =  aF{aQ),  -oo  <Q<  +oo.  (32) 

Thus,  for  a  continuous-time  signal,  a  scaling  in  time  can  be  accomplished  in  principle  by  a  frequency¬ 
scaling  of  its  Fourier  transform  in  the  opposite  direction  along  with  an  amplitude  scaling.  Now,  consider 
a  discrete-time  sequence  x{n)  whose  Fourier  transform  is 

;f(w)  =  ^{i(n)}  =  ^ar(n)e"'’'"".  (33) 

n 

The  function  X{uj)  is  27r-periodic.  If  we  try  to  define  a  discrete-time  scaling  operation  by  adapting 
(32)  to  (33),  it  will  only  work  for  integer  values  of  a  because  of  the  27r-periodicity  requirement  on  the 
Fourier  transform  of  a  discrete-time  signal.  This  corresponds  to  upsampling  the  discrete-time  signal 
by  an  integer  factor  of  a.  The  implementation  of  our  discrete-time  continuous  scaling  operation  is  as 
follows  (see  Figure  5). 

1.  Given  is  a  discrete-time  signal  x{n)  with  (the  27r-periodic)  Fourier  transform  X(u;). 

2.  Map  the  principal  interval  u)  G  [— 7r,7r]  to  continuous  frequency  Q  (the  real  line)  through  an 
invertible  transformation  Q  = 

3.  Dilate  y(n)  =  X{f^^{Q))  by  the  required  dilation  factor  a  to  form  ya(f^)  =  aY{aQ), 

4.  FormXa(w)  =  Ya{f[uj]) 

5.  The  sequence  Xa(n)  resulting  from  the  inverse  Fourier  transformation  of  Xai^j)  is  the  continuous 
dilation  of  x(n)  by  a 

Xa{n)  =  aG-^{X[r\af{u^))]}.  (34) 

where  denotes  inverse  discrete-time  Fourier  transform. 

Some  examples  of  /(lj)  (w  G  [— tt,  tt])  are: 

•  Bilinear  transform.  Q  =  /{lj)  =  2tan(u;/2). 

•  l/uj-based  transform.  Q  =  f{u))  = 

•  log-based  transform.  Q  =  f{uj)  =  s^n(w)ln 


15-14 


3.1.2  System  Properties 


Let  Sa  denote  the  discrete-time  scaling  operator  defined  above.  It  is  straightforward  to  verify  that  Sa 
has  the  following  properties: 

1.  5a  is  a  linear  operator. 

2.  Sa  (a  ^  1)  is  a  time-varying  operator. 

3.  5i{x(n)}  =  x(n)  as  expected.  This  corresponds  to  the  non-scaling  case. 

4.  The  inverse  operator  5”^{x(n)}  =  5i/a{x(n)}  is  discrete-time  scaling  operation  with  parameter 
1/a. 

5.  Commutativity 

5o{56{x(n)}}  Sb{Sa{x{n)}}  =  5a6{a:(n)}  (35) 

6.  If  the  discrete- time  Fourier  spectrum  in  the  principal  interval  [“7r,7r]  of  an  input  discrete- time 
signal  is  a  function  of  i.e., 

X{u>)^T[f(^)l  (36) 

and  the  function  T(a;')  satisfies 

r(au;')  =  Cia)T{u;'),  (37) 

where  C(a)  is  a  function  of  a,  then  the  output  of  the  discrete-time  scaling  operator  is 

5a{x{n)}  =  aC(a)x(n).  (38) 

Property  6  provides  some  interesting  insights  into  the  discrete-time  scaling  operation.  It  implies  that  if 
the  inverse  Fourier  transform  of  the  function  T[f{u})]  exists,  the  corresponding  time  sequence  represents 
an  eigen-function  of  the  system.  Also,  when  the  input  spectrum  satisfies  (36)  and  (37),  for  example. 

T(a;')  =  u;'"  and  hence  A(a;)  =  T[/(u;)]  =  [/(l^)^  ,  (39) 

the  output  spectrum  is  identical  to  the  input  Within  an  amplitude  factor  aC{a)  in  the  example). 

In  other  words,  the  signal  is  identical  to  a  scaled  version  of  itself  within  an  amplitude  factor. 

3.2  Discrete-time  Self-Similar  Signals 

3.2.1  Self- Similarity 

Two  types  of  self-similar  signals  will  be  discussed  in  here:  deterministic  and  stochastic. 

Definition:  A  discrete-time  sequence  x(n)  is  deterministically  self-similar  or  homogeneous  with  degree 
H  if  it  satisfies  the  following  relation. 

5a{ic(n)}  =  a""^x(n)  (40) 
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for  any  a  >  0.  A  random  process  X{n)  is  said  to  be  statistically  self-similar  with  degree  H  if  it  satisfies 
the  following  equation 

5a,«{iex(n,n')}  =  a-2"i?x(n,n')  (41) 

for  any  a  >  0,  where  Rx{n,  n')  denotes  the  auto-correlation  function  of  sequence  A'(n),  and  5a,6{ar{m,  n)} 
for  a  2-D  function  x(m,n)  is  defined  in  lines  similar  to  that  of  Sa>  However,  the  scaling  operation  is 
applied  on  both  m  and  n  dimensions. 

3.2.2  Discrete-Time  Homogeneous  Signal 

As  mentioned  in  section  3.1.2,  the  time  sequence  corresponding  to  inverse  Fourier  transform  of  function 
[/(u;)]"*,  if  exists,  satisfies  (40)  with  H  =  -(r -f  1).  Thus,  by  choosing  a  function  [/(w)]^  which  is 
absolutely  integrable  in  — tt  to  tt,  we  can  derive  a  class  of  discrete-time  homogeneous  functions.  This 
class  of  homogeneous  functions  could  provide  a  model  for  discrete-time  self-similar  process  in  practice. 
They  also  serve  as  eigen-functions  of  the  discrete-time  scaling  operator  previously  defined. 

As  we  know,  the  class  of  continuous- time,  regular,  homogeneous  functions  such  as  f{i)  =  1  is  limited. 
Truly  continuous  homogeneous  signals  corresponding  to  the  spectrum  do  not  exist  because  it  is  not 
a  valid  Fourier  spectrum.  In  our  formulation  of  discrete-time  self-similar  functions,  we  are  able  to  derive 
purely  discrete  time  sequences  os  lung  as  [/(w)]''  defines  a  valid  discrete-time  Fourier  spectrum.  Non¬ 
trivial  discrete-time  homogeneous  functions  actually  exist  and  can  be  derived  in  the  following  ranges  of 
r  parameter  with  respect  to  different  mappings. 

•  Bilinear  Transform.  —  1  <  r  <  1. 

•  1/u;  Based  Transform.  —1  <  r  <  1. 

•  log-Based  Transform,  r  /  — 1,  — 2,  — 3, ... 

Figure  6  shows  some  examples  of  discrete-time  deterministic  self-similar  functions  which  are  derived 
from  discrete-time  Fourier  spectrum  [/(w)]^^^.  /(u;)  is  chosen  as  bilinear  transform,  l/w-based  and 
log-hased  transform  respectively. 

3.3  Discrete-Time  Linear  Scale-Invariant  Systems  And  Self-Similar  Func¬ 
tions 

3.3.1  Discrete- Time  Linear  Scale-Invariant  System 

A  linear  scale-invariant  (LSI)  system  is  a  linear  system  whose  output  is  invariant  to  the  scale  changes 
of  the  input.  Let  a:(n),  y(n),  (n  G  (-00,00))  be  the  input  and  output  sequence.  Take  an  arbitrary 
discrete-time  sequence  h{k)  {k  =  1,2,  and  let 

K 

y{n)  =  Y,h{k)Sk{x{n)]/k.  (42) 

kzzl 
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Figure  6:  Eigen-functions  of  the  discrete-time  s 
transform  (b)  l/a;-based  transform  (c)  log-hased 
functions  are  shown. 
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This  defines  a  discrete-time  LSI  system.  The  output  of  the  system  is  the  summation  of  a  series  of  dilated 
(by  k)  input  sequence,  weighted  by  the  kernel  h{k).  Figure  7  shows  the  procedure  for  the  implementation 
of  the  system.  Note  that  the  choice  of  the  one  dimensional  kernel  h{k)  is  arbitrary.  Also,  as  mentioned 
in  section  3.2,  the  discrete-time  sequences  corresponding  to  spectrum  [/(w)]^  are  eigen-functions  of  the 
discrete-time  scaling  operator.  Inherently,  they  also  serve  as  the  eigen-functions  of  the  discrete-time  LSI 
system. 


3.3.2  Discrete-time  Statistically  Self-Similar  Signal 

As  mentioned  in  [5,  10],  most  physical  processes  that  exhibit  statistical  self-similarity  are  fundamentally 
non-stationary.  The  statistical  properties  of  the  signal  change  with  time,  but  remain  invariant  with  time 
scale.  In  this  section  we  provide  a  model  for  such  non-stationary  self-similar  random  processes  using 
the  discrete-time  LSI  system.  Our  implementation  in  discrete-time  domain  is  based  on  the  following 
property  of  the  discrete- time  LSI  system. 

Theorem:  If  the  input  sequence  of  a  discrete-time  LSI  system  is  discrete- time  zero-mean  white  noise, 
the  output  sequence  of  the  system  is  non-stationary  and  statistically  self-similar  which  satisfies  condition 
(41)  with  H  =  -1. 

proof:  See  [12]. 

Hence  we  can  construct  a  non-stationary  self-similar  random  signal  with  parameter  if  =  —1  by  pass¬ 
ing  a  discrete-time  zero-mean  white  noise  through  the  discrete-time  LSI  system.  By  passing  the  signal 
thus  obtained  through  the  system  again,  a  non-stationary  self-similar  random  signal  with  parameter 
if  =  -2  is  then  acquired.  Following  this  scenario,  we  are  able  to  formulate  a  non-stationary  self-similar 
random  process  with  parameter  if  being  an  arbitrary  negative  integer.  Note  that  the  choice  of  the  one 
dimensional  kernel  h{k)  in  our  discrete-time  LSI  system  is  essentially  arbitrary.  We  can  choose  a  specific 
kernel  h{k)  so  that  the  output  of  the  system  exhibits  the  properties  of  the  studied  physical  self-similar 
processes.  As  there  is  no  restriction  on  the  length  of  the  kernel,  a  rich  class  of  existing  FIR  or  HR  filters 
can  be  applied  to  model  the  behavior  of  a  large  variety  of  self-similar  random  processes  in  practice . 

Figure  8  demonstrates  the  effect  of  passing  a  discrete-time  zero- mean  white  noise  through  a  discrete¬ 
time  LSI  system.  The  output  is  a  discrete-time  stochastic  self-similar  signal.  As  is  known,  if  the  system 
output  is  wide-sense  stationary,  the  2-D  plot  of  auto-correlation  function  of  the  output  signal  will  consist 
of  a  series  of  diagonal  straight  contour.  The  auto-correlation  plot  in  Figure  8  clearly  demonstrate  the 
non-stationary  property  of  the  output  signal. 

4  Scaling  Mixtures  in  Multichannel  Data 

The  detection  and  classification  of  objects  in  images  is  obviously  not  restricted  to  visual  imagery.  It 
can  arise  in  other  contexts  as  well.  In  particular,  our  investigation  examined  the  role  of  scaling  in 
multichannel  recordings  for  bearing  and  velocity  estimation  of  wideband  plane  waves  which  would  have 
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Figure  8:  Simulation  of  passing  a  zero-mean  white  noise  through  discrete-time  LSI  system.  The  6-tap 
Hanning  window  is  used  as  1-d  kernel  for  the  discrete- time  LSI  system,  (a)  system  input  (b)  system 
output  (c)  auto-correlation  function  of  the  output  (d)  contour  plot  of  the  auto-correlation  function  of 
the  output 
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potential  application  to  radar  and  sonar. 


4.1  Scaling  Mixtures  in  Bearing  Estimation 

Suppose  we  have  the  two-dimensional  plane  wave 

s{n,t)  =  Af{t  —  k.u/c)  (43) 

impinging  on  a  uniform  linear  array  of  N  receiving  elements  located  at  (xi,  0),  (x2, 0),  •  •  • ,  (xyv,  0)  respec¬ 
tively,  where  /(x)  is  a  square-integrable  function,  u  =  [ux,  Uy]  is  the  spatial  position  vector,  k  =  iy] 
is  the  direction  vector,  c  is  the  speed  of  propagation  of  the  plane  wave  in  the  medium  and  Xn  =  (n  —  l)d 
with  d  being  the  sensor  spacing.  In  the  narrowband  formulation,  /(x)  is  either  a  complex  exponential  or 
something  close  to  it.  However,  the  assumption  of  /(x)  possessing  finite  energy  rules  out  such  a  model 
here.  The  recording  in  the  sensors  is  denoted  x„(t)  for  n  =  1, . . . ,  7/.  We  have 

-  1)^ +«(0  (44) 

where  n(t)  is  the  additive  noise  which  could  be  due  to  any  or  all  of  clutter,  sensor  noise  and  jamming. 
Examining  (44)  we  find  that  a  snapshot  in  time  of  the  sensor  recordings  consists  of  samples  of  the 
function  /(x)  taken  at  x  =  t  —  ^(n  —  1)  for  n  =  1, . . . ,  A^.  An  integration  over  time  of  the  correlation 
matrix  of  snapshots  yields  (ignoring  the  noise  for  the  time  being)  the  N  x  N  Toeplitz  matrix 


RkAi) 

RkAN-1) 

R  = 

RkM 

RkAN-2) 

(45) 

RkAN-2)  ■■■ 

RkAO) 

where 

RkAr) 

1 

II 

(46) 

with 

R/{r)  = 

/  /{<)/(<  +  ’■) 

(47) 

being  the  temporal  autocorrelation  of  the  function  /(<).  Thus,  we  find  that  the  angle  of  arrival  as 
determined  by  reveals  itself  as  a  temporal  scaling  k^d/c  in  the  autocorrelation  matrix.  The  angle 
of  arrival  determination  problem  is  thus  transformed  into  one  of  determining  the  dilation  or  scaling 
from  the  spatial  autocorrelation  matrix  in  the  presence  of  noise.  With  multiple  plane  waves,  the  spatial 
autocorrelation  matrix  consists  of  the  sum  of  autocorrelations  at  different  scalings  and  hence  constitutes 
a  scaling  mixture. 

4.2  Scaling  Mixtures  in  Velocity  Estimation 

Suppose  the  pulse  f{t)  above  is  reflected  ofT  a  moving  object  whose  velocity  component  along  the 
direction  of  travel  of  the  pulse  is  v.  Then,  upon  reflection,  the  pulse  f(t)  suffers  a  time  scaling  of 
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(c-f  v)lc,  that  is,  if  f{i)  is  symmetric,  the  received  pulse  is  a  time-delay  of  /  ^  temporal 

autocorrelation  of  the  received  pulse  would  result  in  a  time-scaled  autocorrelation  of  the  transmit  pulse 
and  would  be  independent  of  the  time-delay.  Clearly,  the  time-delay  is  an  estimate  of  the  range  while 
the  scaling  provides  an  estimate  of  the  velocity  v.  The  temporal  autocorrelation  when  there  are  multiple 
objects  at  different  velocities  again  results  in  a  scaling  mixture  of  the  autocorrelation  of  the  pulse. 

4.3  Scale  Estimation 

In  the  particular  case  that  the  function  f{t)  is  a  wavelet  then  so  is  its  autocorrelation.  A  wavelet 
transform  of  the  autocorrelation  scaling  mixture  using  this  wavelet  would  lead  to  scale  determination 
since  the  scale  coordinates  of  peak  positions  in  the  wavelet  transform  correspond  to  the  scale  parameters 
of  the  individual  components  in  the  input  [4].  However,  it  is  likely  that  high  resolution  estimation  of 
the  scaling  parameters  would  require  procedures  analogous  to  the  subspace  methods  developed  for  the 
narrowband  case.  We  have  begun  investigating  the  development  of  such  methods.  This  is  a  promising 
area  for  further  research. 
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Abstract 


This  grant  developed  tools  for  use  in  performance  evaluation  and  benchmarking  of  the  IPL  HTML  interface 
to  multimedia  databases.  The  major  contributions  of  the  grant,  described  in  this  report,  were  in  the  areas  of 
developing  benchmarking  support  for  IPL  HTML,  developing  benchmarking  tools  for  use  in  ongoing  per¬ 
formance  evaluations  of  the  evolving  IPL  HTML  software,  and  experimentation  with  software  that  could  be 
used  in  the  IPL  HTML  environment  to  enhance  performance.  Each  of  these  contributions  is  described  fur¬ 
ther  in  the  report.  The  report  concludes  with  a  summary  of  the  contributions  to  the  IPL  HTML  system  that 
were  developed  under  this  grant. 
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IPL  HTML  Interface  Performance  Evaluation 


Scott  Spetka 


Introduction 

This  grant  developed  tools  for  use  in  performance  evaluation  and  benchmarking  of  the  IPL  HTML  interface 
to  multimedia  databases.  The  major  contributions  of  the  grant,  described  in  this  report,  were  in  the  areas  of 
developing  benchmarking  support  for  IPL  HTML,  developing  benchmarking  tools  for  use  in  ongoing  per¬ 
formance  evaluations  of  the  evolving  IPL  HTML  software,  and  experimentation  with  software  that  could  be 
used  in  the  IPL  HTML  environment  to  enhance  performance.  Each  of  these  contributions  is  described  fur¬ 
ther  in  the  report.  The  report  concludes  with  a  summary  of  the  contributions  to  the  IPL  HTML  system  that 
were  developed  under  this  grant. 

Benchmarking  Support 

Benchmarking  tools  were  integrated  into  the  implementation  of  IPL  HTML  to  enhance  the  SQL  query 
interface  and  the  Harvest  query  interface  with  real-time  feedback.  This  code  allowed  experimentation  with 
queries  of  varying  complexity.  The  output  displays  detailed  system  performance  information.  It  includes 
numbers  of  page  faults,  cpu  usage,  numbers  of  I/O  operations  and  elapsed  time.  The  most  useful  metric 
proved  to  be  the  elapsed  time  since  internal  performance  metrics  were  not  available  for  Harvest  and  Sybase. 
Experiments  using  these  benchmarking  tools  proved  that  the  systems  were  comparable  in  speed  with  Har¬ 
vest  outperforming  Sybase  for  some  types  of  queries. 

The  experiment  proved  that  Harvest  could  be  an  important  tool  for  an  information  analyst.  It  is  particularly 
effective  in  cases  where  queries  are  very  selective  (produce  small  results  compared  to  the  data  space 
searched).  Harvest  does  not  allow  "Less  than"  or  "greater  than"  operators  in  queries  and  does  not  support 
complex  aggregate  operations,  like  averages,  supported  by  general  purpose  database  systems  like  Sybase. 
The  study  confirmed  the  relevance  of  Harvest  to  the  continuing  development  of  the  IPL  HTML  architec¬ 
ture.  The  benchmarking  tools  were  implemented  in  the  actual  IPL  HTML  code  where  they  can  be  used  for 
continual  evaluation  of  system  performance. 
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Benchmarking  Tools 


IPL  HTML  is  implemented  as  an  extension  to  a  WWW  server.  Measuring  the  performance  of  CGI-BIN 
extensions  to  the  HTTPD  requires  tools  to  generate  a  load  on  the  server.  The  study  considered  the  use  of 
WebStone,  a  tool  that  automatically  generates  a  load  on  a  server  by  submitting  a  mix  of  queries.  Experi¬ 
ments  with  this  tool  demonstrated  its  usefulness  for  IPL  HTML.  Experiments  were  conducted  at  SUNY 
Institute  of  Technology  using  university  equipment  made  available  for  this  research. 

The  WebStone  software  was  installed  on  the  SUNY  Institute  of  Technology  DogNET  Unix  network.  Sup¬ 
porting  software  for  WebStone  was  made  and  installed.  This  included  Perl  5.0  and  Gnu  Make.  This 
included  Gcc  V  2.7.2.  WebStone  was  then  installed.  Make  was  used  to  develop  other  software  as  well. 

System  were  reconfigured  and  WWW  pages  were  added  to  the  SUNY  Tech  WWW  server  to  keep  track  of 
Webserver  traffic.  An  account  was  added  for  WebStone  on  Growl,  a  Sun  Solaris  system.  The  following  sta¬ 
tistical  software  was  installed  to  monitor  server  load:  Webstat  2.0,  GWstat  2.3,  3Dstat  2.1,  and  Analog  2.0. 
Determining  when  a  system  must  be  enhanced  to  support  increased  loads  is  an  important  part  of  perfor¬ 
mance  evaluation. 

Setting  up  and  testing  WebStone  turned  out  to  be  difficult.  WebStone  did  not  run  on  FreeBSD,  the  operat¬ 
ing  system  in  use  on  most  of  the  DogNET  machines.  It  compiled  but  did  not  work  right.  The  documenta¬ 
tion  is  very  poor  and  there  is  not  much  of  it.  Reddog  is  running  Solaris  2.5.1  but  it  is  the  only  system  run¬ 
ning  Solaris.  WebStone  needs  at  last  two  computers  to  run,  a  master  and  a  client.  Spot  and  Growl  both  run 
Sun  OS  4.1.3.  Gcc  needed  to  be  installed  on  the  two  SunOS  machines  and  some  other  system  problems 
needed  to  be  fixed  before  some  test  results  should  be  obtained  from  WebStone. 

Performance  Enhancing  Tools 

Performance  Enhancing  Tools  can  be  used  to  improve  performance  of  the  IPL  HTML  server.  The  system  is 
implemented  as  an  extension  of  a  standard  WWW  server  so  that  any  tools  that  enhance  WWW  server  per¬ 
formance  would  be  of  interest  for  some  functions  of  the  IPL  HTML  server.  Experimentation  with  software 
that  could  be  used  to  enhance  WWW  server  performance  provides  direction  as  to  whether  use  of  this  type 
of  tool  should  be  considered  for  the  IPL  HTML  environment.  Squid  is  a  system  of  caching  software  for 
web  clients  and  front  end  software  for  web  servers.  Experiments  were  performed  at  SUNY  Institute  of 
Technology  on  the  DogNET  to  evaluate  whether  this  software  should  be  considered  for  IPL  HTML. 
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Squid  was  installed  and  configured  on  Pitbull.  Pitbull  was  chosen  because  it  had  sufficient  disk  space  and 
was  not  running  any  other  services  on  DogNET.  Squid  Squid  can  be  configured  in  two  ways.  In  a  web 
cache  proxy  configuration,  squid  is  used  as  a  cache  for  incoming  web  traffic  from  the  Internet.  It  is  best 
suited  for  a  situation  where  a  number  of  analysts  are  accessing  the  same  remote  server.  The  first  time  a  page 
is  requested  from  an  outside  server  it  is  stored  on  the  cache  proxy.  Any  other  requests  to  these  pages  will 
be  obtained  from  the  cache  proxy  and  not  from  the  remote  server.  In  this  configuration  outside  network  traf¬ 
fic  is  reduced  and  the  pages  that  are  requested  by  multiple  users  can  be  delivered  more  quickly. 

The  second  way  that  Squid  can  be  configured  is  as  an  accelerator.  In  this  configuration  squid  acts  as  a  mir¬ 
ror  to  an  existing  server.  One  way  to  use  this  configuration  is  to  put  the  Webserver  on  the  inside  of  a  fire¬ 
wall  and  to  have  the  accelerator  on  the  outside  of  the  firewall.  In  this  configuration,  Squid  maintains  a 
cache  of  pages  from  the  associated  WWW  server  and  delivers  them  without  placing  a  load  on  the  produc¬ 
tion  WWW  server.  When  a  page  is  required  that  is  not  in  the  cache,  of  course  Squid  must  retrieve  it  from 
the  WWW  server.  As  with  any  cacheing  scheme,  locality  of  reference  in  access  patterns  is  required  for  suc¬ 
cessful  operation.  This  may  well  be  the  case  for  IPL  HTML  where  recently  archived  multimedia  objects  are 
expected  to  be  of  greater  interest  that  are  older  objects. 

Another  reason  to  use  Squid  as  a  accelerator  is  to  create  a  mirror.  For  example,  the  primary  server  could  be 
in  Spain  but  serving  data  to  a  significant  number  of  locations  in  the  United  States.  Squid  could  run  on  a 
computer  in  the  US  and  request  people  from  the  US  to  use  it.  This  would  cut  down  on  the  bandwidth 
requirements  for  higher  cost  international  network  links.  The  software  and  documentation  are  both  good 
for  this  shareware  software. 

Results 

An  experiment  was  conducted  using  WebStone  as  a  tool  to  measure  the  performance  of  the  SUNY  Tech 
WWW  server  with  and  without  Squid  acceleration.  The  study  compared  the  speed  of  our  Webserver  and 
the  speed  of  Squid  using  it  as  an  accelerator  to  our  Webserver.  Results  presented  here  show  that  going 
directly  to  the  Webserver  is  faster  than  going  through  the  Squid  accelerator. 

Below  is  a  summery  of  tests  from  WebStone.  The  experiment  was  performed  twice  on  each  configuration  to 
measure  server  performance  with  and  without  Squid  acceleration.  The  Squid  server  was  running  on  Pitbull 
and  the  WWW  server  was  running  on  Arf  as  indicated  in  the  results. 
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First  run  testing  the  squid  server  running  on  Pitbull: 


WEBSTONE  2.0  results: 


Total  number  of  clients: 

2 

Test  time: 

5  minutes 

Server  connection  rate: 

8.77  connections/sec 

Server  error  rate: 

0.0000  err/sec 

Server  throughput: 

1.09  Mbit/sec 

Little’s  Load  Factor: 

1.99 

Average  response  time: 

0.2276  sec 

Error  Level: 

0.0000  % 

Average  client  throughput: 

562  Kbit/sec 

Sum  of  client  response  times: 

598.794322  sec 

Total  number  of  pages  read: 

2631 

Second  run  testing  the  squid  server  on  Pitbull 


WEBSTONE  2.0  results: 


Total  number  of  clients: 


Test  time: 


Server  connection  rate: 


Server  error  rate: 


Server  throughput: 


Little’s  Load  Factor: 


Average  response  time: 


Error  Level: 


Average  client  throughput: 


Sum  of  client  response  times: 


Total  number  of  pages  read: 


5  minutes 


8.87  connections/sec 


0.0000  err/sec 


1.03  Mbit/sec 


1.99 


0.2248  sec 


0.0000  % 


531  Kbit/sec 


598.99352  sec 


2660 


Conclusion 


The  benchmarking  tools  developed  under  this  grant  have  already  proven  useful  in  performance  evaluation 
of  the  IPL  HTML  interface.  They  are  part  of  the  Evolving  IPL  HTML  Interface.  Experiments  with  perfor¬ 
mance  enhancing  tools  were  less  conclusive.  For  the  DogNET  network,  using  Webserver  directly  was  faster 
then  going  to  the  Squid  server.  This  could  have  happened  for  a  number  of  reasons.  It  could  be  our  network 
topology  or  differences  in  the  configurations  of  systems  that  Squid  and  our  WWW  server  are  running  on. 
Experiments  showed  that  Squid  is  a  stable  piece  of  software  and  that  further  development  and  experimenta¬ 
tion  with  Squid  may  be  useful.  In  addition,  experience  with  WebStone  will  prove  useful  for  further  work  in 
IPL  HTML  performance  evaluation.  In  any  case.  Squid  would  be  of  value  if  it  can  be  run  effectively  as  a 
cache  proxy. 

Providing  an  effective  cacheing  mechanism  for  IPL  HTML  goes  beyond  the  basic  functionality  of  Squid. 
Most  of  the  IPL  HTML  functionality  is  implemented  in  CGI-BIN  functions  which  generate  pages  dynami¬ 
cally.  Data  that  is  presented  to  users  is  derived  from  multimedia  databases  rather  than  stored  HTML  docu¬ 
ments.  The  advantages  of  cacheing  could  easily  be  worth  an  effort  to  extend  Squid  to  provide  adequate 
HTML  and  multimedia  object  cacheing  to  allow  dynamic  pages  to  be  created.  It  is  imperative  that  such 
functionality  be  developed  as  increases  in  demand  for  Internet  based  services  continue  to  outpace  the 
increases  in  network  bandwidth  available.  A  cost-effective  solution  in  the  short  term  will  be  to  depend  on 
software  support  to  improve  performance  and  avoid  expensive  hardware  upgrades  until  prices  come  down. 
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ABSTRACT 


The  confinement  of  optical  modes  of  vibration  in  a  superlattice  consisting  of  polar 
and  nonpolar  materials  is  described  by  a  continuum  model.  Specifically,  the  structure 
under  investigation  is  the  Si/ZnS  superlattice.  Optical  phonon  modes  in  Si  and  ZnS 
layers  are  totally  confined  within  their  respective  layers  since  both  layers  can  be  treated 
as  infinitely  rigid  with  respect  to  the  other  layer.  Since  there  are  no  associated  electric 
fields  with  nonpolar  optical  phonons  in  Si  layers,  only  a  mechanical  boundary  condition 
needs  to  be  satisfied  for  these  nonpolar  optical  modes  at  the  Si-ZnS  interface.  The 
optical  phonons  in  Si  layers  can  be  described  by  guided  modes  consisting  of  an  uncoupled 
s-TO  mode  and  a  hybrid  of  LO  and  p-TO  modes  with  no  interface  modes.  In  ZnS 
layers,  a  continuum  model  hybridizing  the  LO,  TO  and  IP  modes  is  necessary  to  satisfy 
both  the  mechanical  and  electrostatic  boundary  conditions  at  the  heterointerface.  A 
numerical  procedure  is  provided  to  determine  the  common  frequency  between  LO,  TO, 
and  IP  modes.  This  is  a  new  procedure  for  obtaining  the  eigen  modes  of  a  mixed  polar- 
nonpolar  heterosystem.  Analytical  expressions  are  obtained  for  the  ionic  displacement 
and  associated  electric  field  as  well  as  scalar  and  vector  potentials.  The  established 
model  for  the  confined  optical  phonons  is  used  in  calculating  the  intersubband  heavy-hole 
scattering  rate  by  optical  phonons  in  the  Si/ZnS  superlattice.  Our  results  indicate  that 
contributions  to  the  intersubband  scattering  rate  from  Si  or  ZnS  confined  optical  phonons 
depend  strongly  on  the  distribution  of  envelope  wavefunctions  over  the  respective  layers 
within  which  different  types  of  optical  phonons  are  confined. 
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Intersubband  Heavy-Hole  Scattering  by  Confined  Optical  Phonons  in 

Si/ZnS  Super  lattices: 

a  Design  Step  Towards  Si/ZnS  Near- Infrared  Intersubband  Lasers 

Gang  Sun 

I.  INTRODUCTION 

The  demonstration  of  the  InGaAs/AlInAs  intersubband  quantum  cascade  laser 
at  A  =  4.2/xm[l,  2]  has  spurred  interest  in  the  use  of  silicon  as  the  lasing  material 
because  of  its  integrability  with  advanced  silicon  microelectronics[3,  4].  There  is  also 
interest  in  moving  the  lasing  from  the  far-  and  mid-infrared  range  to  the  near-infrared 
optical  communications  wavelengths,  A  =  1.3  or  1.55//m[5].  Since  the  latter  wavelength 
corresponds  to  a  photon  energy  of  SOOmeV,  the  Sii-^Gei/Si  heterosystem  is  inadequate 
because  a  maximum  practical  valence  band  offset  of  only  ~  500meU  can  be  obtained 
for  pseudomorphic  Sii_a;Gei  layers  at  a;  =  0.5  ~  0.6.  Therefore,  alternative  large- 
bandgap,  nearly  lattice-matched  barrier  materials  for  Si  quantum  wells  must  be  sought; 
materials  with  sufficiently  large  band  offsets  with  respect  to  silicon.  Possible  candidates 
include,  ZnS,  BeSeTe,  CaF2,  SiOi,  Si02,  the  Si/Si02  superlattice,  and  7-AI2O3,  among 
others  [5]- [7]. 

The  Si/ZnS  heterosystem  has  received  the  most  attention  as  current  advances 
in  epitaxy  technology  have  allowed  the  growth  of  heterostructures  consisting  of  polar 
and  nonpolar  materials[8,  9].  The  lattice  mismatch  of  cubic  ZnS  with  respect  to  Si 
is  only  0.3%.  The  valence  band  offset  has  been  predicted  theoretically[10]-[13],  while 
recent  experiments[14]  show  that  the  value  is  close  to  1.5eV,  sufficiently  large  to  give 
intersubband  energy  differences  in  the  desired  SOOmeV  range.  MBE  growth  of  ZnS 
upon  Si,  and  Si  upon  ZnS  have  been  demonstrated [9],  with  the  use  of  an  As  monolayer 
to  satisfy  the  local  bonding  requirements,  although  the  effect  of  the  monolayer  on  the 
offsets  has  not  been  determined. 

The  possibility  of  population  inversion  and  the  operation  of  intersubband  lasers 
depend  critically  on  the  lifetimes  of  the  involved  subbands.  The  subband  lifetimes  in 
turn  are  determined  by  nonradiative  phonon  scattering  processes.  The  purpose  of  the 
present  paper  is  to  study  the  optical  phonon  modes  and  their  interaction  with  carriers 
in  the  Si/ZnS  system  since  the  optical  phonon  scattering  is  considered  to  be  dominant 
in  the  phonon  scattering  processes.  This  combination  of  materials  is  new,  since  it  con¬ 
sists  of  both  a  nonpolar  and  polar  semiconductor.  Previous  studies  in  carrier  scattering 
by  confined  optical  phonons  in  heterostructures  have  been  focused  only  on  one  type  of 
phonons,  either  polar[15]-[23]  or  nonpolar[24]-[27].  In  the  current  situation  involving 
both  polar  and  nonpolar  materials,  carrier  scattering  by  both  types  of  phonons  needs 
to  be  considered.  To  the  best  of  our  knowledge,  there  has  not  been  any  reported  in¬ 
vestigation  on  this  mixed  nature  of  optical  phonons,  their  confinement  effect,  and  their 
interaction  with  carriers  in  a  heterostructure.  In  this  paper,  we  will  present  a  theoret¬ 
ical  study  based  on  the  macroscopic  continuum  model  to  describe  the  confined  optical 


17-3 


phonon  modes  and  will  use  this  model  to  calculate  the  optical-phonon  scattering  of 
heavy  holes  in  a  heterostructure  consisting  of  polar  (ZnS)  and  nonpolar  materials  (Si), 
as  we  are  interested  in  the  feasibility  of  constructing  an  intersubband  laser  within  the 
valence  band  of  the  Si/ZnS  heterostructure.  This  valence  intersubband  laser  will  likely 
be  a  quantum-parallel  superlattice  laser[28]  or  a  quantum  cascade  laser.  Our  latest 
thinking[29]  is  that  each  of  the  N  laser  periods  will  consists  of  one  square  Si  quantum 
well  containing  two  active  heavy-hole  subbands.  Due  to  the  non-parabolicity  of  these 
subbands,  we  believe  that  a  population  inversion,  localized  in  fc-space,  can  be  engineered 
between  the  subbands.  In  this  investigation,  we  will  consider  a  simple  superlattice  com¬ 
prised  of  alternating  layers  of  Si  and  ZnS,  much  like  the  flat-band  superlattice  of  the 
quantum  parallel  laser  (Fig. 2(b)  of  [28]).  The  results  of  this  study  will  provide  the  basis 
for  the  calculation  of  subband  lifetimes  required  to  determine  laser  gain  and  threshold. 

As  described  below  in  greater  detail,  since  the  optical  dispersions  (frequency 
versus  wavevector)  of  the  silicon  {husi  =  SimeV)  and  zinc  sulphide  {huzns  =  4SmeV) 
have  no  overlap,  the  optical  phonons  are  assumed  to  be  totally  confined  in  both  materials. 
In  the  silicon  layers,  a  continuum  model  with  double  hybridization  of  the  longitudinal 
optical  (LO)  and  transverse  optical  (TO)  modes  is  used  to  describe  the  vibration  patterns 
of  the  guided  modes[24].  The  only  boundary  condition  that  needs  to  be  satisfied  in  the 
Si  layers  is  the  vanishing  of  the  displacements  at  the  Si-ZnS  interface,  since  the  ZnS 
layers  can  be  considered  as  infinitely  rigid  with  respect  to  the  vibrations  of  the  Si  layer. 
Hence,  there  is  no  interface  mode  in  the  Si  layers.  The  situation  on  the  ZnS  layers  is  more 
complex.  Following  the  work  by  Ridley[16,  17],  here  a  continuum  model  is  employed 
with  hybridization  of  the  optical  LO,  TO,  and  interface  polariton  (IP)  modes  needed 
to  satisfy  both  the  mechanical  and  electrostatic  boundary  conditions  at  the  interfaces. 
Specifically,  the  electrostatic  boundary  conditions  are  the  continuity  of  E^,  the  electric 
field  parallel  to  the  interface,  and  the  continuity  of  the  displacement  field  normal  to 
the  interface.  The  mechanical  boundary  condition  is  again  the  vanishing  of  the  optical 
displacements  since  the  Si  layers  can  be  considered  as  infinitely  rigid  with  respect  to  the 
vibrations  of  the  ZnS  layers. 

Our  current  work  provides  a  complete  set  of  analytical  expressions  for  the  optical 
phonon  dispersion  relations,  optical  displacements,  and  associated  scalar  and  vector 
potentials.  These  expressions  are  subsequently  used  in  calculating  the  interaction  of 
heavy  holes  with  the  confined  optical  phonons.  In  Section  II,  we  establish  a  continuum 
model  for  the  optical  displacement  modes  in  Si  and  ZnS  layers  satisfying  both  mechanical 
and  electrical  boundary  conditions.  In  section  III,  we  outline  a  numerical  procedure  for 
determining  the  frequency  of  a  ZnS  optical  mode  inducing  the  intersubband  scattering. 
In  Section  IV,  we  describe  the  scalar  and  vector  potentials  associated  with  the  ZnS 
ionic  displacement  modes.  In  Section  V,  we  calculate  the  intersubband  scattering  rate 
due  to  the  emission  of  Si  and  ZnS  optical  phonons  since  the  emission  process  is  rather 
significant  compared  to  the  scattering  process  of  optical  phonon  absorption.  In  Section 
VI,  we  summarize  and  discuss  our  results  and  conclusions. 


II.  Mode  Patterns 


A  continuum  model  for  the  optical  modes  in  the  Si/ZnS  superlattice  is  employed. 
Both  mechanical  and  electrical  boundary  conditions  are  satisfied  at  the  heterointerfaces. 
Since  the  optical  dispersion  relations  (frequency  versus  phonon  wavevector)  in  the  two 
bulk  materials  have  no  overlap,  the  phonons  are  taken  to  be  confined  in  their  respective 
materials.  For  the  Si  layers,  the  continuum  model  for  optical  phonons  in  nonpolar 
materials[24,  26]  is  used.  Here  double  hybridization  of  the  LO  (longitudinal  optical) 
and  TO  (transverse  optical)  modes  is  used  to  give  the  vibration  patterns  of  the  guided 
modes.  Since  the  ZnS  layers  are  infinitely  rigid  with  respect  to  the  vibrations  of  the  Si 
layers,  only  the  mechanical  boundary  condition,  the  vanishing  of  the  displacements  at 
the  interfaces,  has  to  be  satisfied. 

For  the  polar  ZnS  layers,  the  situation  is  more  complex  and  an  alternate  contin¬ 
uum  model[16,  17]  consisting  of  an  intermixing  of  confined  LO,  TO,  and  IP  (interface 
polariton)  modes  is  needed  in  order  to  satisfy  both  the  electrostatic  and  mechanical 
boundary  conditions.  The  boundary  conditions  which  must  be  satisfied  are  (1)  the  con¬ 
tinuity  of  Ex,  the  component  of  electric  field  parallel  to  the  interface,  (2)  the  continuity 
of  D^,  the  component  of  the  displacement  vector  normal  to  the  interface,  and  (3)  the 
vanishing  of  the  vector  displacement  u  at  the  interface. 

A.  Modes  in  Si  Layers 

As  discussed  above,  since  the  ZnS  layers  can  be  treated  as  infinitely  rigid,  the 
boundary  condition  to  be  satisfied  in  the  Si  layers  is  the  vanishing  of  the  ionic  dis¬ 
placement  of  all  confined  vibration  modes.  This  is  an  assumption  of  strict  confinement 
yielding  only  the  guided  modes.  As  pointed  out  in  the  continuum  theory [24],  the  ionic 
displacement  of  confined  vibrations  has  two  components:  one  is  the  hybrid  of  the  LO 
and  p-polarized  TO  (p-TO)  modes,  and  other  is  the  uncoupled  s-polarized  TO  (s-TO) 
mode.  These  modes  are  defined  as  follows:  If  we  consider  a  {x,  z)  plane  containing  the 
normal  to  the  layers  and  the  phonon  wavevector  Q,  then 

Q  —  "b  (1) 

where  and  e'^  are  unit  vectors.  The  p-TO  mode  has  its  displacements  normal  to  Q 
and  in  the  plane,  while  the  s-TO  displacements  are  normal  to  Q  and  perpendicular  to 
the  plane  {\\ey).  The  form  of  the  ionic  displacement,  scalar,  and  vector  potentials  in 
one  superlattice  period  differs  from  that  in  a  neighboring  period  only  by  a  phase  factor 
proportional  to  the  Bloch  superlattice  wavevector  qsl-  Their  expressions  given  below 
are  obtained  by  taking  gsL  =  0.  A  description  of  the  s-TO  mode  is 

Uy  =  (2) 

while  the  hybrid  of  the  LO  and  p-TO  modes  is  given  by 

Ux  =  +  qAAp-roe^'^^^  +  Bp_Toe-*'^^")], 


(3) 


which  are  confined  within  the  Si  layer  with  a  width  of  dsu  0  <  z  <  dsi-  The  2- 
components  of  the  LO  and  TO  wavevector  have  been  distinguished  by  ql  and  qt,  re¬ 
spectively. 

Since  the  LO  and  TO  modes  must  have  the  same  frequency  to  be  effectively 
coupled,  we  must  satisfy  the  condition 

aliql  +  ql)  =  ul  -  a^iql  +  q^),  (4) 

where  ujq  is  the  bulk  Si  optical  phonon  frequency  at  F  point,  ai  and  ar  are  the  sound 
velocities  of  LO  and  TO  dispersions  in  Si,  respectively. 

Using  the  boundary  condition  that  u  =  0  at  the  interfaces  gives  for  the  s-TO 

mode 

TITT 

Uy  =  sm{q^z),  with  q^  =  —  (5) 

dsi 

where  n  =  1,2, •••  and  .4  is  a  mode  coefficient.  This  mode  does  not  mix  with  other 
modes. 


The  hybrid  LO  and  p-TO  modes  admit  two  classes  of  solutions.  The  ‘sine’  solu¬ 
tion  is 

Ux  =  2Be^^^^qx[cos{qLz)  -  cos{qTz)], 

2 

Uj  =  2iBe^'^^^[qL  sin(9£,2:)  -f  —  sin(g7’-2)], 

Qt 


and  the  ‘cosine’  solution  is 


Ux  =  2iBe''>^-[qx  siniqLz)  -h  ^  sin(9T^)], 
Ux  =  2Be^^^^qL[cos{qiz)  -  cos(gr2)] 


where 


Ql 


ni-K 

— —  and  qx 

dsi 


nj-TT 


(8) 


where  Ul  —  1,  2,  ■  •  •,  nr  =  3, 4,  •  •  •,  rir  -  til  =  2, 4, 6,  •  •  •,  and  5  is  a  mode  coefficient. 
No  interface  modes  exist  in  the  Si  layer  because  of  the  boundary  condition  u  =  0. 


The  lowest  s-TO  mode  pattern  in  Eq.(5)  for  q^  =  n/dsi  is  shown  in  Fig. 1(a) 
within  a  Si  layer  of  dsi  =  40A,  while  the  hybrid  patterns  of  the  lowest  p-TO  and  LO 
modes  with  qi  =  n/dsi  and  qx  =  Sir/dsi  are  shown  in  Figs.l(b)  and  1(c)  for  the  ‘sine’ 
and  ‘cosine’  solutions  given  in  Eqs.(6)  and  (7),  respectively  within  the  same  Si  layer.  The 
strict  confinement  which  requires  the  vanishing  of  ionic  displacements  at  the  boundaries 
of  Si  layers  is  clearly  demonstrated  for  both  vibration  modes. 


B.  Modes  in  ZnS  Layers 


The  boundary  conditions  are  the  continuity  of  Ex,  D^,  and  the  vanishing  of  u  at 
the  interfaces.  These  conditions  can  be  satisfied  by  a  unique  linear  combination  of  LO, 
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TO,  and  IP  modes  with  common  frequency  and  common  in-plane  wavevector  Qx, 

u  =  ViLO  +  ^TO  +  U/p.  (9) 

We  will  use  this  hybrid  expression  to  calculate  the  electrical  interaction  with  carriers 
which  is  considerably  stronger  than  the  optical  deformation  potential  interaction.  We 
need  consider  only  the  displacements  Ux  and  Uz,  since  Uy  associated  with  the  s-TO  mode 
has  no  related  electric  field  and  therefore  does  not  interact  with  carriers  electrically. 
Once  again,  the  expressions  are  obtained  by  taking  the  Bloch  superlattice  wavevector 
qsL  =  0. 

For  the  LO  mode,  the  ionic  displacements  are 

which  is  confined  within  the  ZnS  layer  with  a  width  of  dznS,  -dznsl‘^  <  z  <  dzus/"^ 
The  associated  electric  fields  are 


Ex  PqUxj  E^  — 


where 


with  the  effective  ionic  charge 


(11) 

(12) 


=  Mnuloeli—  -  -),  (13) 

^OO 

where  M  is  the  reduced  mass,  Co  is  the  permittivity  of  free  space,  too,  Cs  are  the  high- 
frequency  and  static  permittivities,  and  ft  is  the  volume  of  primitive  unit  cell.  The  scalar 
potential  (j)  associated  with  the  electric  field  E  =  -Vcj)  is  in  turn  given  as 

(t>  =  (14) 


For  the  TO  mode 

The  electric  fields  associated  with  this  mode  are  negligible. 

For  the  IP  mode 

Ux  =  -f  Spe-'-?^*), 

u,  =  -  Bpe"*"'’") 

The  associated  electric  fields  are 

Ex  Pp'^xi  E^  —  PpU^,  (1”^) 
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where 


Pp  —  Po 


'TO 


,  i2 

‘^LO 


,  >2 
U!to 


(18) 


and  uio  and  uto  are  bulk  ZnS  LO  and  TO  optical  phonon  frequencies  at  F  point, 
respectively.  The  electric  fields  associated  with  the  interface  modes  propagate  into  the 
Si  layers  although  they  are  treated  as  infinitely  rigid  and  do  not  contain  ZnS  ionic 
displacement. 


Being  a  transverse  electromagnetic  wave,  there  is  a  vector  potential  A  associated 
with  the  electric  field  E  =  -dA/dt.  Within  the  ZnS  layers, 

A,  = 

While  in  the  Si  layers,  a  similar  expression  can  be  obtained  with  another  set  of  mode 
coefficients,  Api  and  Bpi. 

Since  large  in-plane  wavevectors  are  likely  to  be  involved  when  dealing  with  carrier 
transitions  due  to  optical  phonons  between  two  subbands  separated  with  a  relatively 
large  energy,  we  have  endeavored  to  obtain  analytical  dispersions  of  the  LO  and  TO 
optical  branches  by  curve-fitting  the  experimental  bulk  phonon  dispersions  for  the  entire 
Brillioun  zone.  The  requirement  for  common  frequency  yields. 


^  =  ^LO  -  Piiql  +  ql) 
=  i-^TO  -  Priql  +  qr) 

^  c{ql  +  qiy^ 

n{uj) 


(20) 


where  (5i  —  O.SOSTi/z  •  and  /3p  =  2.194THz  •  are  obtained  from  curve-fitting 
the  bulk  ZnS  optical  phonon  dispersions,  c  is  the  velocity  of  light  in  vacuum,  and 
is  the  index  of  refraction.  In  the  above  expressions,  the  frequency  in  the  ZnS  layers 
lies  between  the  ZnS  LO  and  TO  zone  center  frequencies.  Since  upo  <  ujlo,  in  order 
for  the  TO  frequency  to  be  equal  to  a  LO  frequency  qp  must  be  imaginary  qp  =  iqg, 
corresponding  to  a  TO  interface  mode.  The  modes  which  interact  most  strongly  with 
carriers  are  those  with  frequencies  near  the  LO  branch.  For  these  modes,  the  value  of 
qo  is  large,  and  w(’  can  take  the  approximation 


tanh(god2„s)  ^  1.  (21) 

In  the  unretarded  limit  (c  ^  oo),  +  ^2  _  q  jp 

Applying,  at  the  two  interfaces  between  layers  Si  and  ZnS  in  a  period  of  the 
superlattice,  the  conditions  that  and  equal  to  zero  along  with  the  continuity  of 
and  D^,  leads  to  eight  simultaneous  equations  involving  the  eight  unknown  mode  coef¬ 
ficients  {Al,  Bi,  At,  Bj-;  Ap,  Bp]  and  .4pi,Bpi).  The  following  two  ionic  displacement 
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mode  patterns  emerge  for  the  hybrid  in  Eq.(9)  taking  the  Bloch  superlattice  wavevector 
qsL  =  0  and  the  approximation  ta.nh{qodzns)  ~  1-  Both  ionic  displacement  patterns  are 
confined  within  the  ZnS  layer,  —dzns/^  <  z  <  dzns/"^-  For  the  first  type, 

Ux  =  2iBe^'^’^^qx{sm{qLz) 

-[1  -  Pi  tB.nhiqxdzns/'^)]  sin{qLdzns/2)  . 

sinh{qodzns/2) 

.  I  ,  sinh(9,,z) 

-Pi  sin  9Ldz„s/2  — — — - 

cosh(ga,dzns/2) 

Uz  =  2Be'''^^^qL{cos{qiz)  ^  ^ 

— ^[1  —  Pi  tanh(5a;d^„5/2)]  sin{qLdzns/^)  ■t-z: — ^ 
qigo  smh{qodzns/2) 

Qx  ■  (  ,  /o\  cosh(ga.z) 

- Pi  s\n{qidzns  2) — — ^}, 

Ql  cos\i{qxdznsl^) 

and  for  the  second  type, 

Ux  =  2Be^’‘^'^qx{cos{qiz) 

-[1  -  p2  cot]\{qxdznsl‘2)] 

smh(9od2„s/2) 

-P2  cos{qLdzns/2)  . 

sinh{qxdzns/2) 

Uz  =  2iBe'‘>^'^qL{sm{qLz)  '■  ’ 

+ j£_[l  _  p2  coth(g^d2n5/2)]  cos{qLdzns/2)  . 

QlQo  smh{qodzns/2) 

+  — P2  COs{qLdzns/2)  . 

Ql  smh.{qxdzns/2) 

where 

^  cosh(g:zdst/2)  cosh(gz:dgns/2) 
d 

_  smh{qxdsi/2)  smh{qxdzns/2) 

~  d  ’  (24) 

d  =r  smh{qxdsi/2)  cosh{qxdzns/2)  +  smh{qxdzns/2)  cosh{qxdsi/2), 
y  —  f£i 

fp2 

and  Cpi  and  tp2  are  the  permittivities  in  Si  and  ZnS  layers,  respectively,  with 


ep2  —  fc 


U)"  —  UJl 


To  illustrate  the  patterns  of  ionic  displacements  in  the  ZnS  layers  given  in  Eqs.(22) 
and  (23),  we  need  to  first  determine  values  for  qx,  qi,  and  qo-  To  do  so,  we  will  follow 
the  numerical  procedure  described  in  Section  III  by  arbitrarily  fixing  a  value  for  the  in¬ 
plane  phonon  wavevector  qx  =  -K/idazus),  where  azns  is  the  lattice  constant  of  ZnS.  In 


calculating  the  carrier-optical  phonon  interaction,  the  value  of  Qx  is  actually  determined 
by  the  conservation  of  in-plane  momentum  between  the  initial  and  final  states  of  the 
scattering  process.  For  a  given  value  of  Qx,  typically,  a  set  of  hybridized  modes  can  be 
obtained.  Here,  we  show  only  the  mode  pattern  with  frequency  close  to  uw- 

We  obtained  hiv  =  AlmeV,  qi  =  0.46  x  10®/cm  and  qo  —  0.48  x  10®/cm.  Substi¬ 
tuting  these  values  into  Eqs.(22)  and  (23),  we  obtained  Figs. 2(a)  and  2(b)  showing  the 
mode  patterns  of  ionic  displacement  of  both  the  first  and  second  types,  respectively  in 
a  ZnS  layer  of  dzns  =  20A.  It  can  be  seen  from  Figs. 2(a)  and  2(b)  that  the  mechanical 
boundary  condition,  vanishing  of  the  ionic  displacements  at  the  interfaces  of  Si  and  ZnS 
layers,  is  satisfied. 


III.  Dispersion  Relationship 


The  phonon  frequency  in  the  ZnS  layers  is  determined  by  the  following  set  of 
equations; 

^  =  ^LO-  PM  +  ql)^ 

U)  =  ujto  -  Priql  -  ql), 


where 


with 


and 


ti  -f  t2  cos{qLdzns)  +  h  sinlqidzns)  =  0 


ti  =  4psinh(^j,d5i)  +  4prsmh{qxdzns), 

t2  =  -4pa, 

t3  =  %p^r  smh{qxdzns)  smh{qxdsi)  -  4p^a^ 

-)-4pV2  sinh.'^ (qxdzns)  +  Ap^  smh^{qxdsi)  +  1, 


P  = 


qx 


Aqirsd 


s  = 


UJ< 


TO 


^LO  ^TO 


(26) 


(27) 


(28) 


(29) 


The  third  equation  in  (26)  is  obtained  from  the  requirement  of  a  nonzero  solution  for 
the  eight  simultaneous  equations  discussed  above,  and  Eq.(27)  is  arrived  at  under  the 
approximation,  tanh(godzns)  ~  1- 


The  numerical  procedure  for  determining  a  phonon  frequency  is  the  following: 
given  a  value  of  qx,  we  can  determine  those  of  ti,  t2,  and  from  Eq.(27).  Then  u  is 
scanned  from  loto  to  loiq-  For  a  given  value  of  w,  qi  and  qo  are  obtained  from  the  first 
two  equations  in  (26).  Those  values  are  then  substituted  into  the  third  equation  in  (26) 
to  determine  if  the  particular  value  of  a;  is  a  solution. 


IV.  Scalar  and  Vector  Potentials 
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Associated  with  the  two  types  of  ionic  displacement  in  Eqs.(22)  and  (23),  the 
scalar  potentials  in  ZnS  layers  are  given  as,  for  the  first  type. 


2poBe^'^^^  s\n{qiZi)  l^il  <  ^  ZnS  layer, 

0  1^2!  <  ^  Si  layer. 


and  for  the  second  type. 


— 2ipo5e*^^®  cos(9i2i)  \zi\  <  ZnS  layer, 

■#■  =  J  (31) 

0  ^2!  <  -^  Si  layer. 

z 

Note  that  we  have  used  two  different  coordinates  zi  and  Z2  for  layers  ZnS  and  Si,  re¬ 
spectively,  with  their  origins  placed  at  the  centers  of  the  respective  layers. 

The  vector  potentials  can  be  obtained,  for  the  first  type, 

2spoqx  o  ig^x  •  /  j  /n\  sinh(ga,zi)  dznS  „  c-  1 

- Be''"  Pismiqidzns/^) — w— 1 - j?x  ki  <  -7r~  ^nS  layer, 

A^  =  )  u;  cosh{q^dzns/2)  2  (32) 

sinh(5,Z2)  k2|  <  %  Si  layer, 

w  uj  Z 

sin(tedz„5/2)  ZnS  layer. 

A,  =  \  cosh(9xdzn5/2)  2  (33) 

cosh(g,Z2)  1-^21  <  ^  Si  layer, 

1.  uj  2 

and  for  the  second  type, 


A.  = 


Ax  = 


2%SPoqx  _ /  J  ,n\  cosh(9x2i)  I  I  ^  dznS  ry  o  , 

- ; — P2  cos{qLdzns/2)  - -  2(1  <  ^—  ZnS  layer, 

u  smh{qxdzns/2)  2 

_3lElBe'‘^^^V2Cosh.{qxZ2)  \z2\  <  ^  Si  layer, 

U)  z 


A.  = 


2spoqx  D„iaxx _ j  /o^  sinh(9xZi)  dznS  ry  o  1 

- Be  P2Cos{qLdznS  2)  - —  zi  <  ZnS  layer, 

a;  sinh(gxdzns/2)  2 

^Be'''""\/2 sinh(9x^2)  ^2!  <  %  Si  layer, 

(jj  z 


where 


Vi  = 


rs\n{qidznsl2)  cosh(gxd2ns/2) 


Vo  = 


'Mlrdzus!'^ 


sinh(gxdzns/2) 


and  Pi,  P2,  and  d  are  given  in  Eq.(24). 
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The  scalar  potentials  associated  with  the  LO  modes  are  strictly  confined  within 
the  ZnS  layers.  Their  distributions  are  shown  in  Fig.3  for  the  first  and  second  types 
given  in  Eqs.(30)  and  (31)  with  ql  =  0.31  x  10®/cm,  dzns  =  20A,  respectively. 

The  vector  potential  associated  with  the  IP  modes  are  distributed  in  both  Si  and 
ZnS  layers,  even  though  Si  layers  are  treated  as  infinitely  rigid  and  do  not  contain  ZnS 
ionic  displacements.  The  profiles  for  the  two  components  of  the  vector  potentials  given 
in  Eqs.  (32-35)  for  the  first  and  second  types  with  dsi  =  40A,  dznS  =  20A  are  shown  in 
Figs. 4(a)  and  4(b),  respectively. 

It  can  be  seen  from  Figs. 3  and  4  that  both  scalar  and  vector  potentials  are 
not  continuous  across  the  interfaces.  However,  as  pointed  by  Ridley[16],  the  energy  of 
interaction  with  an  electron  traveling  coherently  with  the  optical  phonon  is  continuous. 
The  electric  field  can  be  obtained  as 

E=-V0-f. 

The  continuity  of  Ex  and  =  e(a;)E'2  implies  that  at  the  boundaries, 

‘^^x\z2=±dsi/2  =  ~Qx(l>\zi-^dzns/2  + ‘^"4i|2i=Tdzns/2i 
■^z\z2=±dsi/2  =  ^^2|2i==Fdz„s/2) 

where  Aix  and  Ai^  are  x-  and  z-components  of  the  vector  potential  in  Si  layers.  The 
interaction  in  the  Si  layer  is  e{AixVx  +  Ai^Vz)  and  in  the  ZnS  layer  e{—(j)  +  AxVx  +  AzVz), 
which  are  equal  when  the  electron  velocity  Vx  =  lj/qx  and  =  0.  Thus,  the  coherent 
interaction  energy  is  continuous  across  the  interfaces. 

The  electric  field  distributions  for  Ex  and  €{uj)  in  Si  {dsi  =  40A)  and  ZnS  {dznS  = 
40A)  layers  are  shown  in  Figs. 5(a)  and  5(b)  for  the  first  and  second  types,  respectively. 
The  continuity  of  Ex  and  across  the  Si  and  ZnS  interface  according  to  Eq.(38)  is 
clearly  demonstrated. 


(37) 

(38) 


V.  Intersubband  Scattering 

Since  the  optical  modes  in  the  Si/ZnS  superlattice  consist  of  confined  nonpolar  Si 
and  polar  ZnS  optical  phonons,  the  calculation  of  carrier  scattering  by  optical  phonons 
in  such  a  structure  needs  to  include  contributions  from  both  types  of  phonons.  The 
Hamiltonian  that  describes  the  carrier  interaction  with  the  nonpolar  Si  optical  phonons 
is  given  by[30] 

77  =  •  u  (39) 

where  D  is  the  optical  deformation  potential.  This  Hamiltonian  obviously  vanishes 
outside  of  the  Si  layers  in  the  Si/ZnS  superlattice  since  the  Si  optical  displacement 
modes  are  strictly  confined  within  the  Si  layers.  However,  the  carrier  interaction  with 
the  confined  polar  ZnS  optical  phonons  extends  over  both  ZnS  and  Si  layers.  The 
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electrical  interaction  Hamiltonian  can  be  obtained  using  the  scalar  and  vector  potentials 

H=-e(j)+-A-  p,  (40) 

m 

where  p  is  the  momentum  operator,  e  and  m  are  the  free  electron  charge  and  mass, 
respectively.  Although  the  scalar  potential  0  associated  with  the  LO  mode  vanishes  in 
Si  layers,  the  A  •  p  interaction  exists  in  both  layers  since  the  vector  potential  associated 
with  the  interface  modes  in  the  ZnS  layers  as  shown  in  Fig. 4  propagates  into  the  Si 
layers  as  well. 

A.  Scattering  due  to  Si  Phonons 


The  displacement  patterns  described  in  Eqs.(5-7)  all  contain  an  arbitrary  constant 
for  the  mode  amplitude  which  can  be  normalized  by  equating  the  energy  of  the  vibration 
mode  with  that  of  a  simple  harmonic  oscillator[16] 

=  n  C'  "*  ■ 

where  S  is  the  sample  surface  area  (in  {x,  y)  plane),  Q,  is  the  volume  of  the  unit  cell,  and 
X  is  the  normal  coordinator  of  the  oscillator.  The  heavy-hole  state  can  be  characterized 
by  |k,  n  >  with  the  in-plane  momentum  k  and  subband  index  n.  In  the  approximation 
of  constant  effective  mass  for  heavy  holes,  the  matrix  element  for  the  transition  from 
state  |k,  n  >  to  |k',  n'  >  due  to  the  emission  of  a  nonpolar  Si  optical  phonon  is 


<  k',n'|H'lk,n  >= 


I  fi  nj^ujo)  ~l~  1]  1^  _  ,  'T'0^ 

I  h[n{LUo)  -f  1]  ~ 

“^PSi^aSdsiAcigL^QT) 

•[F>iG'^„'(9l,  Qt)  +  Qt)]  (hybrid), 


■<^k'±qx,k 


for  the  s-TO  mode  and  the  hybrid  of  the  LO  and  p-TO  mode,  respectively.  n{uo) 
is  the  number  of  Si  optical  phonons  at  thermal  equilibrium,  and  pst  is  the  density 
of  Si.  The  three  components  of  the  optical  deformation  potential,  Dy,  and  D~ 
are  assumed  equal  to  Do  =  D/y/Z  in  the  calculation,  in  view  of  the  assumption  of 
isotropy.  The  Kronecker  symbol  indicates  the  in-plane  {x,y)  momentum  conservation. 
The  normalization  factors  are  given  by 

Aa(9z)  =  ^  [  u*Uydz  (s-TO), 

dsi  Jo  ^43) 

(u*iii  -I-  u\uz)dz  (hybrid). 

The  Gnn' -functions  contain  envelope  wavefunctions,  ipn  and  from  which  interference 
effect  can  be  obtained.  Specifically, 

Gln'iQz)  =  f  Vni’n'Uydz,  (44) 

J  0 
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for  the  s-TO  mode,  and 


fdsi 

^  (45) 

^nn'\QL)QT)  / 

Jo 

for  the  hybrid  of  LO  and  p-TO  mode.  The  heavy-hole  energy  levels  and  envelope  wave- 
functions  are  obtained  by  the  finite  square  well  model  for  the  superlattice  with  the 
heavy-hole  band  offset  taken  to  be  1.5eV^[14]. 


Applying  the  Fermi  golden  rule,  we  obtain  the  scattering  rate  due  to  the  emission 
of  a  nonpolar  Si  optical  phonon, 


+  Wl  ^  \Gnn'? 

2h^  PiUodsi 

’’^hhb^i^o)  +  Ij-Pp  l^nn'  + 

2h^p,Uods,  Ac 


(s-TO) 

(hybrid). 


(46) 


where  we  have  assumed  that  for  the  intersubband  process  (n  7^  n')  the  heavy  holes  are 
scattered  from  the  bottom  of  their  original  subbands  (k  =  0),  and  the  sum  is  over  those 
participating  modes  of  Eq.(8)  that,  according  to  Eq.(4),  yield  values  oi  satisfying  the 
in-plane  momentum  conservation  [24], 


B.  Scattering  due  to  ZnS  Phonons 


The  normalization  of  the  amplitudes  of  the  confined  ZnS  displacement  modes 
can  be  carried  out  by  equating  the  energy  of  a  hybrid,  a  mixture  of  mechanical  and 
eletromagnetic  energies,  with  that  of  a  simple  harmonic  oscillator[16].  Since  only  the  IP 
mode  contributes  electromagnetic  energy  which  is  small  in  magnitude  when  compared 
with  the  mechanical  energy,  neglecting  it  entirely  will  introduce  little  error  in  evaluating 
the  energy  of  a  hybrid  ZnS  mode.  We  therefore  can  use  Eq.(41)  for  the  normalization 
of  a  ZnS  mode,  except  that  now  the  integral  is  over  the  ZnS  layer,  dz^s-  The  matrix 
element  for  the  transition  from  state  |k,n  >  to  |k',n'  >  due  to  the  emission  of  a  polar 
ZnS  optical  phonon  is 


<  k',n'|if|k,n  >= 


N 


e'^h[n{uzns)  +  ij^ZnS 


tlk.r 


ih 


2SdznsepAi,2  m 


for  both  the  first  and  second  types.  n{uzns)  is  the  number  of  ZnS  optical  phonons  at 
thermal  equilibrium,  and 

1  1  1 

r-  (48) 


6p 


The  normalization  factors  for  both  the  first  and  second  types  can  all  be  calculated  by 

j'JZnS 

^ZnS 


1  rdZnS 

^1,2  "1  /  U^Uz)dz 

(JZnS  J^ 


(.19) 
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with  optical  displacements  given  in  Eqs.(22)  and  (23).  The  G„n' -functions  containing  the 
interference  effect  between  two  subband  envelope  wavefunction,  ipn  and  'ipn',  are  given 
specifically  as 


Po  JO 


for  the  scalar  potential  scattering  associated  with  the  LO  modes,  and 


=  H  A.rni^n'dz, 

1  r^^ZnS  d 

for  the  vector  potential  scattering  associated  with  the  IP  modes.  Applying  the  Fermi 
golden  rule,  the  intersubband  scattering  rate  due  to  the  emission  of  a  polar  ZnS  phonon 
can  then  be  obtained  by  taking  summation  over  contributing  confined  ZnS  optical  modes 


iv  -  +  1]u;lo 

2h^dznse,  t 


_  ^4>  ,  _  ^r<z  12 


where  we  have  again  assumed  that  the  heavy  holes  are  scattered  from  the  bottom  of 
subband  n  (k  =  0),  and  have  taken  the  approximation  of  uJznS  =  ^lo  since  the  modes 
which  interact  most  strongly  with  carriers  are  those  with  frequencies  near  the  LO  branch. 


C.  Intersubband  Scattering  Rates 


The  scattering  rates  due  to  the  emission  of  Si  and  ZnS  optical  phonons  were 
calculated  for  the  intersuband  transition  (2-1)  originated  from  the  bottom  of  the  heavy- 
hole  subband  2  with  zero  kinetic  energy  to  heavy-hole  subband  1.  Figure  6  shows  the 
2-1  scattering  rates  as  a  function  of  the  Si  well  width  while  fixing  the  barrier  width  at 
dzns  =  40A  in  the  Si/ZnS  superlattice.  The  total  scattering  rate  is  the  summation  of 
contributions  from  the  heavy-hole  interaction  with  Si  and  ZnS  optical  phonons.  In  the 
small  well  widtli  region  {dsi  <  30A),  the  heavy-hole  scattering  due  to  the  ZnS  optical 
phonons  is  stronger  than  that  due  to  the  Si  optical  phonons.  This  is  attributed  to  the  fact 
that  when  the  Si  well  width  is  small  there  is  significant  envelope  function  overlap  between 
subbands  1  and  2  in  the  ZnS  barrier  region  where  the  ZnS  LO  phonons  are  confined.  As 
the  well  width  increases,  the  distribution  of  envelope  functions  in  the  barrier  decreases. 
As  a  result,  the  scattering  due  to  the  ZnS  LO  phonons  reduces  considerably  and  the  ZnS 
phonon  scattering  is  mostly  through  IP  modes  which  propagate  throughout  the  super¬ 
lattice  structure.  As  the  well  width  continues  to  increase,  the  energy  separation  between 
subband  1  and  2  decreases.  The  intersubband  scattering  between  these  two  subbands 
requires  an  emitted  ZnS  phonon  with  a  small  in-plane  wavevector  {qx)  in  order  to  satisfy 
the  in-plane  momentum  conservation  for  the  scattering  process  to  take  place.  This  leads 
to  an  increased  intersubband  scattering  rate  since  polar  optical  phonons  with  smaller 
wavevectors  interact  more  strongly  with  carriers  to  induce  intersubband  transitions  as 
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suggested  by  the  well-known  1/ (q^  +  q^)  dependence  of  the  interaction  Hamiltonian  in 
polar  material  quantum  wells[15].  A  similar  dependence  of  the  intersubband  scattering 
rate  of  Eq.(52)  due  to  the  confined  ZnS  optical  phonons  is  implicitly  included  in  the  nor¬ 
malization  factor  (Ai,2)  given  by  Eq.(49).  Further  increasing  the  well  width  to  dsi  >  82A 
causes  the  energy  separation  between  the  two  subbands  to  be  less  than  the  ZnS  optical 
phonon  energy  (43meV)  and  the  heavy  holes  at  the  bottom  of  subband  2  cannot  emit 
ZnS  optical  phonons  to  make  a  transition  to  subband  1,  resulting  in  zero  scattering  rate 
due  to  the  emission  of  ZnS  optical  phonon.  The  scattering  rate  due  to  the  emission 
of  Si  optical  phonon  confined  within  the  Si  well  demonstrates  a  steady  decrease  as  the 
well  width  {dsi)  increases,  which  suggests  that  the  factor  l/dsi  in  Eq.(46)  dominates 
the  small  increase  in  the  interference  effect  G„n/  function.  As  the  well  width  increases 
beyond  62A,  the  energy  separation  between  the  two  heavy-hole  subbands  becomes  less 
than  the  Si  optical  phonon  energy  (64meV^).  As  a  result,  the  scattering  rate  due  to  the 
emission  of  Si  optical  phonons  reduces  to  zero,  in  which  case  the  heavy-hole  lifetime 
of  subband  2  can  be  enhanced  dramatically  since  the  significant  scattering  process  of 
optical  phonon  emission  is  suppressed  although  the  weaker  optical  phonon  absorption 
and  acoustic  phonon  scattering  processes  are  still  possible. 

Figure  7  shows  the  intersubband  scattering  rates  between  the  same  two  heavy- 
hole  subbands  due  to  the  emission  of  both  Si  and  ZnS  optical  phonons  as  a  function  of 
the  barrier  width  {dzns)  in  the  Si/ZnS  supperlattice.  The  well  width  {dsi)  is  fixed  at 
30A.  The  scattering  rate  due  to  the  emission  of  Si  optical  phonons  remains  unchanged 
as  the  barrier  width  varies  since  the  subband  energy  levels  are  hardly  shifted  and  the 
Gorin' -function  for  the  Si  phonon  scattering  has  little  noticeable  change.  The  scattering 
rate  due  to  the  emission  of  an  ZnS  optical  phonon,  on  the  other  hand,  demonstrates 
a  decreasing  trend  as  the  ZnS  barrier  width  increases  as  suggested  in  Eq.(52)  with  the 
factor  of  l/dzns-  The  small  discontinuous  incremental  steps  in  the  ZnS-scattering  curve 
are  due  to  the  discrete  nature  of  the  increase  in  the  number  of  allowed  LO  modes  confined 
in  the  ZnS  barrier  as  it  increases. 

VI.  Summary  and  Discussion 

We  have  provided  an  analytical  model  of  optical  modes  in  Si/ZnS  superlattices 
consisting  of  polar  and  nonpolar  optical  phonons.  This  is  a  new  procedure  for  obtaining 
the  eigen  modes  of  a  mixed  polar-nonpolar  heterosystem.  In  the  Si  layers,  a  continuum 
model  with  double  hybridization  of  the  LO  and  TO  modes  is  used  to  describe  the 
vibration  patterns.  Since  there  is  no  electric  field  resulting  from  the  nonpolar  ionic 
displacements  in  Si  layers,  the  only  boundary  condition  that  needs  to  be  satisfied  in  the 
Si  layers  is  the  vanishing  of  the  displacements  at  the  Si-ZnS  interface,  as  the  ZnS  layers 
can  be  considered  as  infinitely  rigid  with  respect  to  the  vibrations  of  the  Si  layer.  Due  to 
this  strict  confinement,  only  guided  modes  emerge  in  the  Si  layers  which  consist  of  s-TO 
and  coupled  p-TO  and  LO  modes,  with  no  interface  modes.  These  guided  modes  have 
been  illustrated.  Their  interaction  with  carriers  in  the  superlattice  can  be  calculated 
through  the  optical  deformation  potential  for  Si.  The  interaction  Hamiltonian  can  be 
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obtained  by  taking  the  product  of  this  potential  with  the  normalized  ionic  displacement. 

However,  for  the  optical  phonons  in  ZnS  layers,  we  need  to  include  the  electrical 
interaction  in  calculating  the  carrier  scattering  by  optical  phonons,  since  there  are  elec¬ 
tric  fields  associated  with  the  polar  optical  vibrations.  As  a  result,  both  mechanical  and 
electrostatic  boundary  conditions  need  to  be  satisfied  in  the  interfaces.  A  continuum 
model  employing  a  linear  combination  of  LO,  TO  and  IP  (interface  polariton)  modes 
with  a  common  frequency  is  used  to  describe  the  ionic  displacements  in  ZnS  layers.  A 
numerical  procedure  for  determining  a  phonon  frequency  is  provided.  This  hybridized 
model  is  necessary  to  meet  the  simultaneous  requirement  on  the  mechanical  and  electro¬ 
static  boundary  conditions.  The  mechanical  boundary  condition  is  again  the  vanishing 
of  the  optical  displacements  since  Si  layers  can  be  considered  as  infinitely  rigid  with 
respect  to  the  vibrations  of  the  ZnS  layers.  The  electrostatic  boundary  conditions  are 
the  continuity  of  the  electric  field  parallel  to  the  interface,  and  the  continuity  of  the 
displacement  field  normal  to  the  interface.  Based  on  this  set  of  boundary  conditions, 
expressions  are  obtained  for  the  ionic  displacements  in  ZnS  layers  consisting  of  LO,  TO, 
and  IP  modes.  There  are  scalar  and  vector  potentials  associated  with  the  LO  and  IP 
modes,  respectively,  but  no  electric  field  associated  with  the  TO  mode.  The  scalar  po¬ 
tential  and  its  associated  electric  field  due  to  the  LO  mode  are  distributed  only  within 
the  ZnS  layers  and  are  zero  in  the  Si  layers.  But  the  vector  potential  and  its  associ¬ 
ated  electric  field  due  to  the  IP  mode  have  distributions  in  both  ZnS  and  Si  layers  even 
though  there  is  no  ZnS  ionic  displacement  mode  in  the  Si  layers.  Examples  of  these 
mode  characteristics  have  been  demonstrated.  Neither  the  scalar  nor  vector  potential 
is  continuous  across  the  Si-ZnS  interface,  but  the  energy  of  coherent  interaction  with 
carriers  is  continuous  due  to  the  continuity  of  the  electric  field  parallel  to  the  interface. 

The  analytical  model  for  the  confined  optical  modes  consisting  of  polar  and  non¬ 
polar  optical  phonons  is  employed  in  calculating  the  carrier-phonon  interaction.  Our 
results  indicate  that  contributions  to  heavy-hole  intersubband  scattering  from  confined 
Si  and  ZnS  optical  phonons  strongly  depend  on  the  well  width  since  it  varies  the  dis¬ 
tributions  of  envelope  functions  of  involved  subbands  which  ultimately  determines  the 
intersubband  scattering  between  them  through  the  overlapping  interference  effect  G„„'- 
function.  For  small  Si  well  width  (<  30A),  the  scattering  rate  due  to  ZnS  optical  phonon 
is  stronger  than  that  of  Si  optical  phonons.  As  the  well  width  increases  the  scattering 
rate  due  to  the  Si  optical  phonons  surpasses  that  of  ZnS  optical  phonons.  The  scattering 
rate  dependence  on  barrier  width  is  relatively  weak. 
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Figure  Captions 


Figure  1.  Vibration  patterns  in  a  Si  layer  with  a  width  of  40 A  for  (a)  the  guided  s-TO 
mode,  (b)  the  ‘sine’  solution,  and  (c)  the  ‘cosine’  solution  of  the  guided  p-TO  and  LO 
modes. 


Figure  2.  Vibration  patterns  in  a  ZnS  layer  with  a  width  of  20A  for  (a)  the  first  type 
and  (b)  the  second  type  solutions  of  the  hybridized  LO,  TO  and  IP  modes. 

Figure  3.  Scalar  potential  distribution  associated  with  the  LO  modes  in  a  period  of 
the  Si/ZnS  superlattice  with  dsi  =  40 A  and  dzns  =  20 A  for  both  the  first  and  second 
types  of  the  vibration  modes. 


Figure  4.  Vector  potentials  associated  with  the  IP  modes  distributed  in  a  period  of  the 
Si/ZnS  superlattice  with  dsi  =  40A  and  dznS  =  20A  for  (a)  the  first  type  and  (b)  the 
second  type  of  the  vibration  modes. 

Figure  5.  The  field  distributions,  and  D^,  derived  from  the  scalar  and  vector 
potentials,  in  a  period  of  the  Si/ZnS  superlattice  with  dsi  =  40A  and  dznS  =  20A  for 
(a)  the  first  type  and  (b)  the  second  type  of  the  vibration  modes. 

Figure  6.  Intersubband  scattering  rates  due  to  the  emission  of  Si  and  ZnS  optical 
phonons  as  a  function  of  Si  well  width  {dsi)  for  a  barrier  width  of  dznS  =  40A. 

Figure  7.  Intersubband  scattering  rates  due  to  the  emission  of  Si  and  ZnS  optical 
phonons  as  a  function  of  ZnS  barrier  width  (dzns)  for  a  well  width  of  dst  =  30A. 
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ABSTRACT 


This  paper  describes  the  design  and  use  of  a  hardware  and  software  environment  for  investigations  in  array 
processing  (AP).  The  system  unites  the  numerical  processing  abilities  of  MatLab  with  the  real  world  through 
multichannel  data  acquisition  hardware.  Utilizing  MatLab’s  scripting  language,  MEX-files,  and  C++  code 
modules,  an  adaptable,  unified,  and  expandable  software  environment  has  been  developed  which  can  be  readily 
modified  for  the  equipment  at  hand.  This  modular  software/hardware  environment  not  only  improves  the 
ability  of  researchers  to  refine  AP  techniques,  but  assists  in  the  integration  of  these  tools  into  other  technologies. 
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Figure  1:  Simplified  Block  Diagram  of  AHTAPA 
Structure. 

1.  INTRODUCTION 

The  field  of  array  processing  encompasses  a  large  number  of  algorithms  and  applications.  To  begin  the  task 
of  developing  a  general  purpose  hardware  test-bed  for  experimentation  in  this  field,  it  is  appropriate  to  narrow 
the  focus  to  some  subclass  of  algorithms  with  interesting  properties.  In  the  past  two  decades,  a  plethora  of 
techniques  have  been  spawned  for  adapting  filters,  demodulators,  and  receiver  arrays  to  extract  unknown 
signals  of  interest  from  corrupted  data  signals.  These  techniques  have  come  to  be  referred  to  as  blind  source 

separation  (BSS).  Examples  of  its  main  categories  are: 

♦  Coherence  exploitation  techniques:  utilizes  higher-order  spectral  analysis  and  cummulants,  and  self¬ 
coherence  restoration  algorithms  em-ploying  known  spatial  spectral,  or  self-coherence  properties  of  the 
transmitted  signal  [1,23,4]. 

*  Modulus  restoration  techniques:  utilize  known  modulus  properties  of  the  transmitted  signal  [5,6]. 

•  Anticipatory  techniques:  utilize  maximum  likelihood  and  burst-acquisition  methods,  which  exploit 
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known  temporal  support  of  the  transmitted  signal  [7]. 

The  diversity  of  approaches  to  blind  source  separation  leads  to  discussions  of  the  relative  merits  of  different 
approaches.  Each  approach  has  circumstances  to  which  it  is  better  suited  than  another  approach,  and  sometimes  a 
combination  of  techniques  might  be  in  order.  The  ‘‘best”  algorithm  for  a  given  situation  can  be  difficult  to 
determine.  Also,  the  development  and  testing  of  AP  algorithms  in  a  real-life  scenario  can  be  very  difficult,  and  time 
consuming  as  tools  are  reinvented.  This  underscores  the  need  for  a  unified,  adaptable  computer  environment 
utilizing  all  the  tools  and  information  at  hand  —allowing  rapid  implementation  and  analysis  of  any  AP  algorithm. 
This  paper  presents  such  a  computer  environment. 

2.  AHTAPA:  UNDERLYING  ARCHITECTURE 

An  environment  fulfilling  the  above  mentioned  criteria  is  depicted  in  figures  1  and  2.  The  setup  consists 
of  a  host  workstation  which  provides  overall  control,  multichannel  data  acquisition  cards  which  collect  data 
from  multiple  transducers  (microphones,  antennae,  or  light  sensors,  etc.),  a  Pentium  Based  PC  for  data  collection 
and  control,  and  assorted  peripheral  devices  that  facilitate  input,  output,  control,  and  signal  conditioning. 

The  host  program  provides  an  interface  to  the  array  processing  modules,  to  data  analysis  and  file  management  tools, 
and  to  other  utilities  that  may  be  needed  -  thus  unifying  the  entire  research  and  development  process  into  a  single 
application.  A  simplified  block  diagram  illustrating  AHTAPA’s  modularity  is  shown  in  Figure  1 . 

AHTAPA  is  a  MatLab  script  which  interacts  with  other  MatLab  scripts  and  C++  subroutines.  In  essence,  the 
MatLab  host  script  spawns  a  user  interface  serving  as  a  front  end  and  organizer  to  the  other  programming  modules. 
New  modules  can  be  added  in  a  simple  ‘‘plug-n-play”  fashion.  Keeping  in  mind  the  goal  of  unifying  the  research 
and  development  process,  a  key  aspect  of  the  project  is  the  continued  development  of  a  library  of  categorized  array 
processing  algorithms  which  can  be  expanded  upon  as  needed,  for  comparison  and  contrast,  and  analyzed  w'ith  a 
libraiy  of  graphical,  and  anaKtic  tools. 
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Figure2:  Overview  of  array  processing  environment,  and  block  diagram  of  Hardware 
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3.  ARRAY  HARDWARE 

The  collection  of  real  world  data  is  dependent  upon  the  user’s  situation  and  resources.  For  example,  during  the  first 
stage  of  this  project’s  development,  an  external  analog  to  digital  converter  (the  WaveBook  5 12,  manufactured  by 
lotech)  with  digitall  I/O  was  utilized.  The  unit  has  eight  diferential  inputs  and  a  maximum  sampling  rate  of  1  MS/s. 
The  Wavebook  512  was  connected  to  an  array  of  microphones,  each  of  which  had  its  own  amplifier.  The  digitized 
data  was  then  sent  to  a  laptop  where  file  handling  and  analysis  was  conducted  through  AHTAPA. 

Eventually  a  dual-processor  Pentium  based  PC  was  dedicated  to  data  acquisition  and  contrl.  The  system  utilizes  two 
PCI-based  data  acquisition  cards  produced  by  National  Instruments  (PCTMIO  Series).  Each  card  has  a  maximum 
sampling  rate  of  1  MHz  at  a  12  bit  resolution,  and  eight  differential  channels.  The  computer  has  enough  memory  to 
acquire  over  10  minutes  of  uninterrupted  data  from  eight  channels  simultaneously.  Sample  rate,  nuber  of  activated 
channels,  and  data-resolution  can  be  adjusted. 


Thye  array  system  has  been  designed  to  allow  for  both  real-time  signal  processing  of  real  data,  and  for  off-line  data 


analysis  and  algorithm  development.  A  number  of  subsystems  are  incorporated  into  the  current  environment  and 
include  an  anechoic  chamber,  a  host  workstation,  a  signal  generatin  and  broadcasting  system,  a  signal 
collection/processing  system,  and  miscellaneous  peripherals.  Figure  2  gives  a  block-diagram  overview  of  the  current 
hardware  components  and  their  interconnections. 


However,  a  sophisticated  setup  is  not  a  re<}uirement  for  AHTAPA.  The  modular  construction  of  AHTAPA  allowas 
a  user  to  easily  adapt  AHTAPA  to  individual  needs  and  resources  -  linking  unique  code  modules,  from  the  simplest 
game  port  input  setup  to  the  most  elaborate  DSP  acquisition  card.  The  program  also  comes  with  code  modulaes,  and 
instructions  that  could  be  used  by  a  researcher  to  utilize  common  input  devices  to  acquire  real  world  data  (at 
minimal  cost)  -  such  as  serial,  game,  and  parallel  ports. 

4.  CONTROL  SOFTWARE 

With  the  current  arrangement,  the  host  machine  running  AHTPA  initiates  data  collection  and  control  on  the  PC  over 
the  network,  using  a  well  established  client-server  model.  Aserver  running  on  the  PC  responds  to  requests  issued  by 
a  client  program  running  on  the  host  workstation.  The  two  exchange  instructions,  and  the  collected  data  is  shared 
via  an  NFS  network  drive.  ^Tiile  any  program  supporting  sockets  can  do  this  MatLab  makes  an  excellent  client. 


5.  DATA  ANALYSIS  ISSUES 

The  successful  application  of  any  AP  algorithm  requires  an  understanding  of  the  system  from  which  the 
data  was  collected,  as  well  as  how  it  was  collected.  For  example,  intuition  might  lead  to  the  assumption  that  the 
sound  received  by  a  microphone  from  multiple  sources  is  a  simple  additive  mixture.  However,  this  is  not  the  case. 
The  sound  “heard”  at  a  point  from  multiple  sources  is  a  convoluted  mbcture.  Another  example  is  signals  in  die 
electromagnetic  spectrum,  be  it  radio.  X-rays,  or  visible  light  Intuition  gives  a  fairly  accurate  picture  in  this 
case — that  electromagnetic  phenomena  form  a  simple  additive  mixture.  But  there  are  exceptions  to  this  as  well, 
such  as  radar  pulses  through  mrbulent  water,  or  visible  images  projectd  through  glass  with  local  impurities  and  an 
uneven/warped  surface. 

The  manner  in  which  the  data  is  collected  is  also  important.  Under  some  circumstances  the  data  collection  process 
can  introduce  a  convolution  where  there  normally  would  not  be  one.  For  example;  when  a  geological  survey  of  the 
ocean  bottom  is  conducted,  sonar  equipment  is  sometimes  dragged  along  with  the  ship,  this  can  generate  turbulence 
around  the  sonar  array  that  may  introduce  an  additional  convolution  to  the  data  being  collected.  Also,  a  particular 
AP  algorithm  may  be  sensitive  to  array  geometry.  A  simple,  generalzed  example  of  this  problem  is  illustrated  as 
follows. 
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Suppose  there  are  two  sources  emitting  a  signal  with  wavelength  x.  If  the  sensors  receiving  the  signal  are  a 
distance  0.03x  apart,  they  could  be  behaving  as  a  ‘‘single”  sensor.  The  needed  information  to  separate  the  signals 
from  die  mixture  would  then  be  lost. 

The  above  are  just  a  few  examples  of  the  many  issues  that  must  be  taken  into  consideration  when  collecting  data 
for  AP  analysis —  and  in  deciding  upon  a  particular  AP  algorithm  for  a  given  situation.  As  with  any  analytic  tool, 
be  it  Fourier  analysis.  Wavelets,  or  Time  Frequency  Domain,  knowledge  of  a  method’s  strengths  and 
weaknesses  is  important,  so  that  easy  problems  are  not  made  difficult  and  mistakes  not  misunderstood  as 
solutions. 

6.  SIMULATIONS 

The  AHTAPA  environment  allows  for  easy  experimentation  with  algorithms,  for  example  here  we  present  a 
simulation  that  used  an  algorithm  which  was  a  combination  of  ^proacheds  from  Belouchram,  Cardoso,  neural 
networkd,  and  some  trial  and  error  experimentation.  The  simulation  is  performed  using  measured  source  signals, 
mixed  with  a  realistic  mixing  matrix,  estimated  from  experiments. 

Using  this  new  hybrid  algorithm,  with  a  constant  learning  rate,  some  interesting  results  were  obtained  and  are 
depicted  in  figures  3, 4, 5, 6,  and  7. 
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Figure  3  The  pre-procedded  source  signals  xi,  and 

Figure  4.  The  observed  signals  el  and  e2+4  after  mixing 
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Figu  re  5.  Output  signals  s  1  and  s2+4  during  separation.  Figure  6.  Convergence  of  c  1 2  and  c2 1 .  The  correct 

Solutions  are  the  horizontal  lines 
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Figure  7.  Gap  values 


From  these  results  we  can  see  that  the  coefficients  converge  rapidly  toward  their  correct  values;  the  output  signals 
gradually  regain  the  source  signal  shape;  the  gap  is  decreasing,  but  a  slow  oscillation,  with  about  four  periods  is 
clearly  visible  in  the  graph;  this  oscillation  can  be  seen  in  the  convergence  patterns  of  the  coefficients. 


This  oscillation  might  be  due  to  a  dependence  between  the  source  signals  xi.  Some  of  the  causes  for  this  coupling 
might  be  due  to  the  following  -Electromagnetic  coupling  between  receivers;  oscillation  caused  by  small  difference 
in  frequency  of  the  transmitted  signals  -  which  causes  the  patterns  of  both  signals  to  slowly  shift  with  respect  to 
each  other. 

Since  this  hybrid  algorithm  determines  the  most  independent  components  in  a  set  of  signals,  we  expect  that  the 
algorithm  must  be  able  to  find  this  dependence.  To  verify  this,  the  experiment  was  repeated  using  a  unitary  mixing 
matrix,  so  that  the  mbc  equals  the  source  signals.  If  the  signals  are  fully  independent,  no  action  should  occur  from 
the  network  of  the  algorithm.  However,  a  dependence  is  shown  in  figure  8.,  where  a  fluctuation  is  seen  of  the 
coefficient  around  zero.  The  resulting  output  signals  are  the  most  independent  components  of  the  original  source 
signals  xi. 

Repeating  the  separation  for  mixed  signals,  using  the  most  independent  components,  one  would  expect  the 
convergence  of  the  coefficients  to  be  less  distored  by  the  low  frequency  oscillation.  This  is  verified  in  figure  9. 


Figure  8.  Convergence  of  cl2  and  c2H‘.2  for  the 


7.  CONCLUSION 

An  adaptable,  modular  environment  for  experimentation  in  AP  algorithms,  based  on  MatLab  scripts  and  C++  code 
is  presented.  The  environment  developed  can  be  utilized  by  any  researcher,  and  is  adaptable  to  needs,  and  resources. 
The  goal  is  that  the  work  presented  here  may  be  developed  into  a  standard  tool  for  AP  work — streamlining  the 
development  and  testing  process  as  well  as  assisting  in  integrating  AP  technologies  into  other  disciplines. 
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